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On a height he stood that looked towards 
greater heights. 
Our early approaches to the Infinite 
Are sunrise splendours on a marvellous verge 
While lingers yet unseen the glorious sun. 
What now we see is a shadow of what must 
come. 

Sri Aurobindo, Savitri 1.4 


How I wish I could show you the world 


through my eyes. 
Vivekananda 


Preface 


22 December 2012 marks the 125th birth anniversary of the Indian mathematician 
Srinivasa Ramanujan. Being largely self-taught, he emerged from extreme poverty 
to become one of 20th century’s most influential mathematicians. His story is a 
phenomenal “rags to mathematical riches” story. In his short life, he had a wealth 
of ideas that have transformed and reshaped 20th century mathematics. These ideas 
continue to shape mathematics of the 21st century. 

This book is meant to be a panoramic view of his essential mathematical con- 
tributions. It is not an encyclopedic account of Ramanujan’s work. Rather, it is an 
informal account of some of the major developments that emanated from his work 
in the 20th and 21st centuries. The twelve essays focus on a subset of his significant 
papers and show how these papers shaped the course of modern mathematics. 

These essays are based on lectures given by the authors over the years at 
the Chennai Mathematical Institute, Harish-Chandra Research Institute, IISER 
(Kolkata), IISER (Bhopal), IIT (Powai), IIT (Chennai), Institute for Mathematical 
Sciences (Chennai), and the Tata Institute for Fundamental Research (Mumbai) as 
well as Queen’s University, the Fields Institute, and the University of Toronto. The 
lectures were given so that the material is accessible to undergraduates and grad- 
uate students. We have striven to not be too technical. At the same time, we tried 
to convey some depth of the mathematical theories emerging from the work of Ra- 
manujan. Surely, it is impossible to be comprehensive in such a mammoth task. Still, 
we hope that the reader will see how the vast landscape of Ramanujan’s garden has 
blossomed over the past century. 


Toronto, Canada M. Ram Murty 
V. Kumar Murty 
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Chapter 1 
The Legacy of Srinivasa Ramanujan 


Mathematics enjoys the freedom of art and the precision of science. There is free- 
dom of combination of ideas and concepts, but there is also the precision of logic and 
the ring of truth. It is like a master symphony. The Soviet mathematician, I.R. Sha- 
farevich [186] once remarked that “a superficial glance at mathematics may give an 
impression that it is a result of separate individual efforts of many scientists scattered 
about in continents and in ages. However, the inner logic of its development reminds 
one much more of the work of a single intellect, developing its thought systemat- 
ically and consistently using the variety of human individualities only as a means. 
It resembles an orchestra performing a symphony composed by someone. A theme 
passes from one instrument to another, it is taken up by another and performed with 
irreproachable precision.” 

This is no doubt true and yet, the music reaches a crescendo in the hands of 
certain luminaries. One such luminary was Srinivasa Ramanujan. What is fascinat- 
ing about Ramanujan is that he was largely self-taught and emerged from extreme 
poverty to become one of the 20th century’s influential mathematicians. His story is 
a “rags to mathematical riches” story. In the cosmic symphony of mathematics, he 
played a major role. 

The music of Ramanujan emanates both from his life and his work. Born on 
22 December 1887 in humble and poor surroundings in the town of Erode situated 
in present day Tamil Nadu, India, Ramanujan cultivated his love for mathematics 
singlehandedly and in total isolation. As a child, he was quiet and often to himself. 
Those that knew him were impressed by his shining large eyes which were his most 
prominent features. He had a prodigious memory, and at school, he would entertain 
his friends by reciting the various declensions of Sanskrit roots and by repeating the 
value of the constant z to any number of decimal places. 

At the age of 12, he borrowed a book on trigonometry from an older student and 
completely mastered its contents. This book was Loney’s Plane Trigonometry pub- 
lished by Cambridge in 1894 and contains a great deal of information on summation 
of series, logarithms of complex numbers, calculation of 2 and Gregory’s series. 
This certainly goes far beyond any modern curriculum of trigonometry taught in our 
high schools today. But the book that influenced him the most was Carr’s A synopsis 
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of elementary results in pure and applied mathematics. This book is a compilation 
of some 6165 theorems, systematically arranged but with practically no proofs. It 
is not a remarkable book and was used by students of Carr for their preparation for 
the Mathematical Tripos, the entrance examination of Cambridge University. But 
Ramanujan has made the book famous in that he set about demonstrating to himself 
each of the assertions enunciated therein. To do this, he used a slate, jotting down 
the formula to be proved, erasing it with his elbow, jotting down some more formu- 
las that led to the proof, then erasing them again with his elbow and jotting down 
some more formulas. People used to speak about his bruised elbow, and we know 
how he got it. Thus he worked his way through the book. This experience influenced 
him profoundly, and his contact with this book marks the beginning of his explo- 
ration of the world of mathematics. Carr’s synopsis was therefore a great blessing. 
But unfortunately, Ramanujan took this synopsis as his model for writing, and his 
famous notebooks consisting of over 4000 formulas are written down in this style 
without proofs. The intermediate results, the links of the chain, have been erased by 
the elbow of Ramanujan, and his legacy is simply a set of discoveries, a melody of 
formulas. 

When we look through these formulas discovered by Ramanujan, it is like great 
music echoing through our consciousness and the music lingers. Each is pregnant 
with meaning and heralding further exploration. Perhaps it is not so unfortunate that 
Ramanujan had taken Carr as his model for writing. We all now have work to do. 
“When the kings are building, the carters have work to do.” 

Another thing that we learn from the early mathematical development of Ra- 
manujan, is the importance of problem solving in the primary grades. The math- 
ematician and educator, George Pélya, was right when he stated that mathematics 
cultivates logical and orderly thinking, and a precision for the expression of ideas. 
So Ramanujan mastered a large tract of college level mathematics simply through 
problem solving and working through Carr’s synopsis. 

A year later, in 1903, he secured a seat in the Government College in Kum- 
bakonam. However, his passionate absorption in mathematics led him to neglect his 
other subjects, and the inevitable happened. He failed the exams at the end of his first 
year. Four years later, he entered another college in Madras (now called Chennai) 
but met with the same fate at the end of his first year. 

In 1909, at the age of 22, he married Kumari Janaki, and with his new respon- 
sibility, it was necessary for him to secure a job. This he succeeded in doing in 
1912, when he became a clerk in the Madras Port Trust Office. There his duties 
were light, and he found time to devote to his mathematical research. Moreover, as 
luck would have it, the manager of the office, S.N. Aiyar, was a mathematician who 
took kindly to him and his discoveries. With Aiyar’s encouragement, Ramanujan 
communicated some of his results to several British mathematicians. (For a short 
biography of S.N. Aiyar and the role he played in Ramanujan’s life, we refer the 
reader to a recent article by Berndt [23].) His first three attempts produced little or 
no response. But in 1913, he wrote to G.H. Hardy at Trinity College, Cambridge. 
This was a turning point since Hardy was a renowned expert in analysis and number 
theory. 
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We should say here that number theory should not be confused with numerology. 
There is no mysticism attached to number theory. The only mystifying element is 
that there are beautiful formulas and there is a logical, mathematical order in the 
apparent chaotic universe. Number theory is the study of hidden mathematical pat- 
terns among numbers. It is called the queen of mathematics because the problems 
of number theory have given birth to the diverse disciplines of mathematics. Prob- 
lems are utilized as points of focus of concentration. In themselves, the problems 
are unimportant. But in the finding of their solution, new concepts arise, and new 
links and patterns are found with other concepts and disciplines of mathematics. It 
is the final mosaic that is the end in view and not the esoteric problem, which is used 
only as a means of motivation. 

So when Hardy received the letter, he found himself a little confounded and could 
not at first decide whether it was written by a crank or a genius. To the letter were 
attached about 120 theorems of which a representative sample is given by the fol- 
lowing 15: 
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The number of numbers between A and x which are either squares or sums 
of two squares is 


ah sas +80) (15) 


where K = 0.764..., and 6(x) is very small compared with the previous integral. 


These fifteen entries from Ramanujan’s letter to Hardy give a representative sam- 
ple of the formulas contained there. The first four belong to the theory of infinite se- 
ries. The next three are new definite integrals. Formulas (8) to (12) are in the theory 
of continued fractions. Formula (13) belongs to the theory of complex multiplication 
and singular moduli. Formula (14) is the first suggestion of Ramanujan’s knowledge 
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of the circle method (about which we say more in a later chapter). Finally, (15) be- 
longs to analytic number theory. 

Hardy took over two hours to analyse the letter to ascertain whether the author 
was a crank or a genius. Hardy’s reaction is expressed in his own words: “I should 
like you to begin by trying to reconstruct the immediate reactions of an ordinary 
professional mathematician who receives a letter like this from an unknown Indian 
clerk.” 

“The first question was whether I could recognise anything. I had proved things 
rather like (7) myself and seemed vaguely familiar with (8). Actually (8) is classical; 
it is a formula of Laplace first proved properly by Jacobi and (9) occurs in a paper 
published by Rogers in 1907.” 

So the conclusion was that Ramanujan had rediscovered all of these theorems 
amidst the impoverished mathematical background of his rustic surroundings. 

Hardy continues, “I thought, that as an expert in definite integrals, I could prob- 
ably prove (5) and (6) and did so, though with a good deal more trouble than I had 
expected. ... The series formulas (1)—(4) I found much more intriguing and it soon 
became obvious that Ramanujan must possess much more general theorems and 
was keeping a great deal up his sleeve. ... The formulas (10)-(13) are on a different 
level and obviously both difficult and deep. An expert in elliptic functions can see 
at once that (13) is derived somehow from the theory of complex multiplication, but 
(10)-(12) defeated me completely; I had never seen anything in the least like them 
before. ... The last two formulas stand apart. ... The function in (14) is a genuine 
approximation to the coefficient, though not at all close as Ramanujan imagined 
and Ramanujan’s false statement was one of the most fruitful he ever made, since it 
ended by leading us to all our joint work on partitions.” 

Indeed, (14) could only be derived by the circle method, a powerful technique 
developed later by Hardy and Ramanujan in their work on the partition function. 
The entry in the letter shows that Ramanujan had already thought about the circle 
method in India, before he had met Hardy. 

It seems that Hardy invited his colleague Littlewood and showed him the letter. 
They sat with it for three hours, from 9pm to midnight and finally concluded that 
this indeed was the work of a genius. Hardy [67] wrote later, “A single look at them 
is enough to show that they could only be written down by a mathematician of the 
highest class. They must be true, because if they were not true, no one would have 
had the imagination to invent them.” 

Soon thereafter, Hardy invited Ramanujan to come to Cambridge, which he felt 
could provide a better environment in which his mathematical genius could flourish. 
So Ramanujan sailed for England in March 1914. 

Against the background of the first world war, from 1914 to 1917, Hardy and 
Ramanujan spent time in wonderful mathematical collaboration. Hardy remarks that 
every day Ramanujan would show him about half a dozen new theorems. These 
three years saw prodigious mathematical activity by both Hardy and Ramanujan. In 
that period, Ramanujan wrote over 30 papers, in which were laid the foundations of 
three fundamental methods in number theory. 

The first of them concerns the circle method which, as noted earlier, already 
has its genesis in Eq. (14) of his first letter to Hardy. This method concerns an 
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ingenious idea for computing the integral of a function by studying its behaviour at 
rational points and sufficiently small neighborhoods. The method can be developed 
further and enables one to attack classical unsolved problems such as Goldbach’s 
conjecture, Waring’s problem and explicit formulas for the Fourier coefficients of 
modular forms and modular functions. These problems have defied solution for a 
long time. Due to Ramanujan’s early demise, this work was carried on by Hardy 
and Littlewood, and today it is called the circle method, or the Hardy—Littlewood 
method. In their paper on the partition function, Hardy and Ramanujan laid the 
groundwork of the method. But it was clear that this was a viable technique for 
attacking many age-old problems. Subsequently, this technique was developed and 
improved by Hardy and Littlewood and I.M. Vinogradov. 

The second major contribution was the normal order method which proved that 
almost all natural numbers have loglogn prime factors. These investigations were 
later developed into the beautiful probabilistic theory of numbers, starting with the 
work of Turan and culminating in the celebrated Erdés—Kac theorem. Afterwards, it 
was taken up by Kubilius, who infused finer improvements into the theory. We feel 
that there is a further direction for these investigations into the domain of Fourier 
coefficients of modular forms, and we discuss this theme in this monograph. This 
brings us to the third great contribution of Ramanujan. 

In 1916, he wrote a classic paper entitled “On certain arithmetical functions” in 
which he investigated the Fourier coefficients of various modular forms. There, he 
noticed patterns of congruences and made three significant conjectures concerning 
the behaviour of these Fourier coefficients. The most famous of these concerns the 
Ramanujan t-function. He conjectured that this function satisfies a multiplicative 
law and that its growth is controlled by a simple polynomial function. At the time, 
the conjecture did not have much meaning other than as an esoteric problem in 
analytic number theory. Indeed, regarding the t-function, Hardy [67] wrote, “We 
may seem to be straying into one of the backwaters of mathematics, but the genesis 
of t(n) as a coefficient in so fundamental a function compels us to treat it with 
respect.” 

Putting respect aside, the fact is that the function occupies a central place in the 
pantheon of coefficients of modular forms. So the legacy left by Ramanujan’s con- 
jecture is vast and deep. For it slowly transpired that these conjectures had intimate 
connection with profound aspects of number theory and algebraic geometry. 

The first significant step towards the conjectures of Ramanujan was taken by 
Mordell, who proved the multiplicative law, the first part of Ramanujan’s conjec- 
tures. But Mordell only treated the case of t and did not realize that it was pro- 
totypical of a spectrum of functions, each in its own right of central importance. 
This realization came twenty years later, in the work of the German mathematician 
Erich Hecke (who incidently was also born in the same year as Ramanujan), and the 
meaning of the multiplicative properties conjectured by Ramanujan was unravelled. 
This work of Hecke is considered a masterpiece of mathematics. 

But the connections to algebraic geometry were deeper still. Indeed, after the 
pioneering work of Artin and Hasse, Weil formulated in 1949 general conjectures 
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about solutions of systems of equations over finite fields. In the 1950s, it was sus- 
pected that the t-function of Ramanujan enumerates the number of these solutions 
for a certain system of equations (called a variety). Several Japanese mathemati- 
cians constructed the possible candidate, but there were technical problems related 
to compactification. These problems were resolved in 1974 by a Belgian mathemati- 
cian Pierre Deligne, who completely settled the conjectures of Ramanujan. In 1976, 
Deligne was awarded the Fields medal (which is the mathematical analogue of the 
Nobel prize) for this achievement. This is quite a legacy! 

The connection between Ramanujan’s conjecture and Weil’s conjecture could not 
be so easily foreseen. Both conjectures reflect elements of the master symphony that 
Shafarevich spoke about. Indeed, in his retrospective essay on number theory, André 
Weil wrote [201], “In 1947, in Chicago, I felt bored and depressed, and not knowing 
what to do, I started reading Gauss’s two memoirs on biquadratic residues, which 
I had never read before. The Gaussian integers occur in the second paper. The first 
one deals essentially with the number of solutions of equations ax* — by* = 1 in the 
prime field modulo p, and with the connection between these and certain Gaussian 
sums. ... then I noticed that similar principles can be applied to all equations of 
the form ax” + by” + cz’ +---=0, and that this implies the truth of the so-called 
“Riemann hypothesis” for all curves ax” + by” + cz” = 0 over finite fields, and 
also a “generalized Riemann hypothesis” for varieties in projective space with a 
“diagonal” equation >° a;x/' = 0. This led me in turn to conjectures about varieties 
over finite fields.” 

It was only a matter of time before several notable mathematicians realized that 
Ramanujan’s conjecture was really a “Riemann hypothesis” for a certain zeta func- 
tion of a variety over a finite field and that it would follow from Weil’s conjecture. 
This was the achievement of Pierre Deligne in 1974. 

No one could have foreseen such a cosmic connection. Yet, Weil is quite harsh 
on Hardy and in the same essay [201] wrote “Hardy’s remarkable comment is: “We 
seem to have drifted into one of the backwaters of mathematics.” To him it was just 
another inequality; he found it curious that anyone could get deeply interested in 
it. In fact, he becomes apologetic and explains that, in spite of the apparent lack of 
interest of this problem it might still have some features which made it not unworthy 
of Ramanujan’s attention.” 

The problem with Weil’s assessment of Hardy is that it is inaccurate. Hardy’s 
original quotation is that “We may seem to be straying into one of the backwaters 
of mathematics, but the genesis of t(7) as a coefficient in so fundamental a func- 
tion compels us to treat it with respect.’ The reader will note that Weil replaced 
“We may seem” with “We seem” which gives quite a twist to the meaning. More- 
over, Hardy does not say we will study it simply because Ramanujan had studied 
it, but rather that it is the coefficient of a fundamental function, namely A(z) in the 
theory of modular forms. If Hardy thought that the t-function was in the backwa- 
ters of mathematics, then it would have been unreasonable to give it as a doctoral 
thesis problem to one of his celebrated students, R.A. Rankin, who created what 
is now called Rankin’s method (also called the Rankin—Selberg method) an ana- 
logue of which was instrumental in the resolution of the Weil conjectures. So it is 
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quite presumptuous to make absolute pronouncements on the significance of various 
mathematical ideas since we never know how ideas are interconnected. And this is 
part of the legacy. 

But this legacy does not end here, and these investigations form just the tip of an 
iceberg. The central problem of number theory revolves around what is called the 
reciprocity law. The function of Ramanujan, and Fourier coefficients in general and 
their congruence properties reflect some aspects of the non-abelian reciprocity law. 
The theory of modular forms was further generalized by Jacquet and Langlands, and 
higher-dimensional versions of Ramanujan’s conjectures were formulated as part 
of the Langlands program. Some of these conjectures go beyond the Ramanujan 
conjecture. Finer distribution conjectures concerning the t-function inspired by the 
work of Sato and Tate in the theory of elliptic curves and first enunciated by Serre 
have now been proved. These results represent Himalayan peaks in the mathematical 
landscape of the 21st century. 

Returning to our narrative of Ramanujan, we find that at the end of his three 
years work in England, he left behind a tremendous mathematical legacy. In the 
summer of 1917, he fell ill with what was suspected to be tuberculosis. He never 
recovered. Nevertheless, he continued to work unabated. Hardy relates an interesting 
story during the time that Ramanujan was staying in the hospital in Putney. He went 
to visit him in a taxicab, and as he entered Ramanujan’s room, remarked that he 
had just ridden in a taxicab with number 1729 which seemed to be to him a rather 
dull number and hoped that this was not the indication of a bad omen. Ramanujan 
replied that on the contrary it is a very interesting number. It is the smallest number 
which can be expressed as the sum of two cubes in exactly two different ways: 


1729=1°+12? =10°+9°. 


This is not without significance. Recently, a beautiful theorem in the theory of el- 
liptic curves was proved involving this taxicab number. If for each k, there is a 
squarefree natural number 7 that can be expressed in at least k different ways as the 
sum of two cubes, then an outstanding problem in the theory of elliptic curves has 
an affirmative solution, namely ranks of elliptic curves over Q tend to infinity. 

Ramanujan had great intuition into what was important and central. His facile 
mind revealed an artistic symbiosis between intellect and inspiration. We are re- 
minded of a statement made by a great Indian sage, Swami Vivekananda, over a 
century ago. He said: “Just as the intellect is the instrument of knowledge, so is 
the heart the instrument of inspiration. In a lower state, the heart is a much weaker 
instrument than the intellect. ... Properly cultivated, the heart can be changed and 
will go beyond intellect; it will function through inspiration. Man will have to go 
beyond intellect in the end. The knowledge of man, his powers of perception, of 
reasoning and intellect and heart, all are busy churning this milk of the world. Out 
of long churning comes butter ... Men of [cultivated] heart get the butter and the 
buttermilk is left for the intellectual.” 

We do not know how Ramanujan discovered his theorems. On this point, Hardy 
[67] wrote, “It was his insight into algebraical formulas, transformations of infinite 
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series and so forth, that was most amazing. On this side most certainly I have never 
met his equal, and I can compare him only with Euler or Jacobi. He worked far 
more than the majority of modern mathematicians, by induction from numerical ex- 
amples; all his congruence properties of partitions, for example, were discovered in 
this way. But with his memory, his patience, and his power of calculation he com- 
bined a power of generalisation, a feeling for form, a capacity for rapid modification 
of his hypothesis, that were often really startling, and made him, in his own peculiar 
field, without a rival in his day.” 

These comments were made by Hardy in 1936 when he delivered his famous 
Harvard lectures on the work of Ramanujan. He began them with a rather sentimen- 
tal tone. “I have to help you,” he said, “to form some sort of reasoned estimate of 
the most romantic figure in the recent history of mathematics ... Ramanujan was, 
in a way, my discovery. I did not invent him—like other great men, he invented 
himself—but I was the first really competent person who had a chance to see some 
of his work, and I can still remember with satisfaction that I could recognize at once 
what a treasure I had found ... And my association with him is the one romantic 
incident in my life.” These are powerful feelings indeed describing one of the great 
collaborations of mathematics! 

We conclude this introduction by reflecting upon what we call the cultural legacy 
left behind by Ramanujan. We can do this no better than to relate the feelings ex- 
pressed by the Nobel laureate, Subramanian Chandrasekhar, at the Ramanujan Cen- 
tennial Conference in Urbana in 1987. He wrote [31]: “It must have been a day in 
April 1920, when I was not quite ten years old, when my mother told me of an 
item in the newspaper of the day that a famous Indian mathematician, Ramanujan 
by name, had died the preceding day; and she told me further that Ramanujan had 
gone to England some years earlier, had collaborated with some famous English 
mathematicians and that he had returned only very recently, and was well known 
internationally for what he had achieved. Though I had no idea at that time of what 
kind of a mathematician Ramanujan was, or indeed what scientific achievement 
meant, I can still recall the gladness I felt at the assurance that one brought up un- 
der circumstances similar to my own, could have achieved what I could not grasp. 
I am sure that others were equally gladdened. I hope that it is not hard for you to 
imagine what the example of Ramanujan could have provided for young men and 
women of those times, beginning to look at the world with increasingly different 
perceptions. 

“The fact that Ramanujan’s early years were spent in a scientifically sterile at- 
mosphere, that his life in India was not without hardships, that under circumstances 
that appeared to most Indians as nothing short of miraculous, he had gone to Cam- 
bridge, supported by eminent mathematicians and had returned to India with every 
assurance that he would be considered, in time, as one of the most original math- 
ematicians of the century—these facts were enough—more than enough—for as- 
piring young Indian students to break their bonds of intellectual confinement and 
perhaps soar the way that Ramanujan did. 

“It may be argued, perhaps with some justice, that this was a sentimental atti- 
tude: Ramanujan represents so extreme a fluctuation from the norm that his being 
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born an Indian must be considered to a large extent as accidental. But to the In- 
dians of the time, Ramanujan was not unique in the way we think of him today. 
He was one of others who had, during that same period, achieved, in their judge- 
ment, comparably in science and in other areas of human activity. Gandhi, Nehru, 
Rabindranath Tagore, J.C. Bose, C.V. Raman, M.N. Saha, S.N. Bose and a host of 
others, were in the forefront of the then fermenting scene.” 

In these words of Chandrasekhar, we see the legacy of Ramanujan. For the life 
of Chandrasekhar was equally full of hardships. Born in the same village surround- 
ings as Ramanujan, he went to study at Cambridge and there as a graduate student 
discovered the mathematical implications of the theory of relativity in the collapse 
of certain massive stars. These he predicted degenerate into black holes. The high 
priests of physics of that time rejected his calculations as meaningless. He had to 
wait for another thirty years before the theory came into the forefront of modern 
physics and finally in 1983, he was awarded the Nobel prize in physics as recog- 
nition of his work. The life of Subramanian Chandrasekhar itself reveals to some 
extent the grandeur of the legacy of Ramanujan. 

But a scientist belongs to no nation. Many of the mathematicians of distinction 
that we have met and talked with have all told us that Ramanujan directly or indi- 
rectly inspired their mathematical life. This is not surprising. For as we have seen, 
Ramanujan embodies that marvelous miracle of the human mind to frame concepts 
and to use formulas and symbols as tools of thought to probe deeper into the mys- 
teries of one’s own being. As long as the spirit of science is alive, his legacy will 
live, and the music will pass from one luminary to another. And all of us who think 
and work with mathematical ideas are participants in that wonderful symphony. 


Chapter 2 
The Ramanujan t-Function 


1 Introduction 


The 1916 memoir of Ramanujan, innocuously entitled “On certain arithmetic func- 
tions”, introduced the t-function. This is an integer-valued function on the natural 
numbers which, at first, manifested as part of an “error term” in counting the number 
of ways that a number could be written as a sum of 24 squares. However, Ramanu- 
jan realized that it was a function worthy of study in its own right. It would not be an 
overstatement to say that one of the significant themes of mathematics in the 20th 
century has emanated from this observation. 
The t-function is defined by the formal identity 


[ee 


Yo t@)q" =a] [-4")”. (1) 
n=1 


n=1 


It is in fact more than a formal identity. If we think of g as a complex number with 
|q| < 1, then taking the logarithm of the infinite product and expanding, we see that 
it is 


CO CO gm 
logg- 249°) a, 


n=1m=1 


Interchanging sums, we see that the double sum is 


ioe) 
1 q” 
72 es l—gq™’ 


m=1 


and this converges for |q| < 1. For a complex number z with 3(z) > 0, we see that 
q= eetlz 

satisfies |g| < 1, and so we may define a function A(z) by the right-hand side of (1). 

Moreover, we see that A(z) 4 0 as it is given as an absolutely convergent product 

of non-zero terms. 
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The function A(z) was known to previous authors. If we consider the Eisenstein 
series 


1 
Ex= J. ——z 
(m.n)#(0,0) (MZ +7) 


where the sum ranges over pairs of integers (m,n) which are not both zero, then 
there is the identity 


1 
A() = 179g (E4@” — E6(z)”). 


Dedekind had studied A(z) (and a 24th root of it known as the 7-function). The 
n-function occurs in the transformation properties of the Dedekind sums, and the 
A-function occurs in the theory of the moduli of elliptic curves. It also occurs in 
the limit formula of Kronecker. However, Ramanujan was the first to realize that the 
coefficients of the g-expansion give rise to an interesting arithmetic sequence. 


2 The t-Function and Partitions 


Writing integers as sums of elements of a distinguished subset is a theme that oc- 
cupied Ramanujan in many of his works. As we said, the t-function itself arises 
in the problem of representing an integer as a sum of 24 squares. Ramanujan also 
gave considerable attention to the partition problem, namely the number of ways of 
writing a positive integer as a sum of positive integers. If we denote by p(n) the 
number of such representations of a positive integer n, then we see that the first few 
values are given by p(1) = 1, p(2) = 2, p(3) =3, p(4) =S, p(5) =7, p(6) = 11 
and so on. If we consider the generating function 


Y= pina" 
n=1 


then it is easily seen to be equal to 


(oe) 


T]@-4")". 


m=1 


This bears some superficial resemblance to (1). Indeed, such series had been studied 
classically by Euler, Jacobi and others. However, it was Ramanujan who began to 
observe arithmetical properties of the coefficients, such as congruences. For exam- 
ple, he observed that 


p(5n+4)=0 (mod 5) 
and 


p(In+5)=0 (mod 7). 
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He continued this line of thought to the t-function itself. We know now that such 
congruences are based on deep aspects of the theory of modular forms. 

While the sequence {p(7)} and {t()} share properties such as congruence rela- 
tions, they are, however, very different in terms of their growth properties. Ramanu- 
jan’s circle method (described in another chapter) shows that p(n) grows exponen- 
tially as a function of n, while as we shall see in this chapter, t (7) has a polynomial 
growth inn. 


3 Related Generating Functions 


Euler had studied the function 


[[G-a") (2) 
and proved that it is equal to 


xe [ytgGrt/2. 


neZ 


Numbers of the form n(3n + 1)/2 are called pentagonal. This and related q- 
expansion identities can be expressed in terms of the number of partitions of an 
integer into other integers satisfying various constraints. For example, Euler’s iden- 
tity can be interpreted as giving an expression for the number of partitions of a 
number into an even number of unequal parts minus the number of partitions of the 
same number into an odd number of unequal parts. 

We can ask for which integers m there is an n € Z such that 


m= 5nn +1)? (3) 


We need to have 1 + 24m to be a perfect square, say r*. Moreover, we need —1 +r 
divisible by 6, and in particular, 6 should not divide r. When these conditions are 
satisfied, we have 


: 1 

n= AC 4 
In particular, given m, there are at most two values of n satisfying (3). Thus, when 
(2) is written as a power series in qg, the coefficients are bounded. Moreover, the 
number of m < x for which (3) has a solution is < ./2x/3. In particular, most of 
the coefficients are zero, and the series is “lacunary”’. We can say a little more about 
which coefficients are nonzero. Indeed, for 1 + 8m =r? to be satisfied, we need r 
odd, and 


r). 


r+l=a, r—l=b 
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for some factorization 8m = ab. This implies that r = 5(a +b) anda=b+2.In 
particular, both a and b are even, andr = b+ 1 and 8m = b(b+ 2). 
Jacobi studied the third power of (2) and proved that 


[o,e) [o,e) 


T][a = ay = S"Qn 4 Hger. (4) 


n=1 n=0 


Numbers of the form n(n + 1)/2 are called triangular. Again, it is seen that this 
is a lacunary series, in the sense that the set of n for which the coefficient of g” is 
nonzero has density zero. Note that we have 


oo 24 
> t(n)q" = a( > (-1yngantem2) 
n=1 meZ 


and 


lee) ioe) 8 
dota" =4 ( dim + bgnmnr) 
n=1 m=0 


In particular, one can derive the following formulas for t (7): 
25, 
@@—Drn= (: =e Gm ie 1))e(n = Gm + ») 
1<|m|<an 
where 


i 1 
an = gl +d + 24n)?); 


(n—1)t(n) = x (215771 (Om 4+ D(n =i mm ))e(n- (n+ ») 


l<m<by 


where 
1 1 
= 5 (cl +8n)2 —1). 


The first of these is due to Lehmer, and the second to Ramanujan. However, these 
formulas do not seem to be useful in giving an expression for t(m) in terms of 
elementary functions. They cannot be viewed as ‘closed-form’ expressions since 
the range of summation is a function of the argument. However, they do suffice to 
show that t(7) has at most polynomial growth in n. 


4 Values of the t-Function 


It is easy to compute the first few values: t(1) = 1, t(2) = —24, 1(3) = 252. The 
following table is copied from [110]. 
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n T(n) n T(n) n T(n) 
1 1 11 534,612 21 —4,219,488 
2 —24 12 —370,944 22 — 12,830,688 
3 252 13 —577,738 23 18,643,272 
4 —1472 14 401,856 24 21,288,960 
5 4830 15 1,217,160 25 —25,499,225 
6 —6048 16 987,136 26 13,865,712 
7 —16744 17 —6,905,934 27 —73,279,080 
8 84480 18 2,727,432 28 24,647,168 
9 —113,643 19 10,661,420 29 128,406,630 

10 —115,920 20 —7,109,760 30 —29,211,840 


Looking at this table and others that give more values, many natural questions 
come to mind. Firstly, we see that the numbers are growing fairly rapidly. However, 
the growth is not exponential since it was shown by Ramanujan that 


|z(n)| <n'. 
He conjectured that 
[r()| <d(nyn'? 


where d(n) denotes the number of positive divisors of n. This is known as the Ra- 
manujan conjecture (actually Hardy called it the Ramanujan hypothesis), and it is 
now a theorem as we shall explain in another chapter. The only proof of this relies 
on “reducing” it to a special case of the Weil conjectures and appealing to the proof 
of these conjectures by Deligne. 

It is a classical result that d(n) = O(n‘) for any € > 0, and so a weaker version 
of the Ramanujan conjecture is that for any € > 0, 


T(n) Ke nil/2re 


where the subscript indicates that the implied constant may depend on e€. 
We might also notice from the table a fact that Ramanujan stated as a conjecture, 
namely that the t-function is multiplicative: 


T(mn) = t(m)t(n) (5) 


for positive integers m and n that are relatively prime. Moreover, Ramanujan ob- 
served that for a fixed prime p, the values of t(p”) satisfy a second-order recur- 
rence relation: for m > 1, we have 


t(p™*") = c(p)t(p") — p''t(p™""). (6) 
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Pp T(p) Pp T(p) Dp T(p) 
2 —24 31 —52,843,168 73 1,463,791,322 
3 252 37 —182,213,314 79 38, 116,845,680 
5 4830 41 308, 120,442 83 —29,335,099,668 
7 —16744 43 —17,125,708 89 —24,992,917,110 
mt 534,612 47 2,687,48,496 97 75,013,568, 546 
13 —577,738 53 —1,596,055,698 101 81,742,959, 102 
17 —6,905,934 59 —5,189,203,740 103 —225,755, 128,648 
19 10,661,420 61 6,956,478, 662 107 90,241,258,356 
23 18,643,272 67 —15,481,826,884 109 73,482,676, 310 
29 128,406,630 71 9,791,485,272 113 —85, 146,862,638 


Both of these properties were proved by Mordell within a year of the publication of 
Ramanujan’s paper. These relations imply that all of the values of the t-function can 
be determined once they are known at prime arguments. Above is a table of t(p) 
for primes p < 113 (the first 30 primes). 

Consider again the values of the t-function on powers of a prime. Denote by a, 
and 6, the complex numbers which are roots of the equation 


T* —t(p)T + p!! =0. 
Then t(p) =a) + Bp and ap By = p'!. Let us write 
op = p''/2¢i%p, 


Here 6, is a complex number, and the Ramanujan conjecture is the assertion that in 
fact @, is real. In any case, we deduce that 


t(p) =2p''/? cos(6,). 


This implies that 
t(p)? =4p''(1 —sin’(6,)). 


We see from this that we cannot have sin(6,) = 0 as t(p) is an integer. Moreover, 
relation (6) shows that 


t(p?) =t(p)? — p'! =a, + apy + Bo. 


More generally, by induction, we can show that 
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Equivalently, 
sin(a + 1)0 
t(p’) ape pe (7) 


sin 6p 
5 Parity of the t-Function 


A few more calculations will show that t(p) seems to be even for all primes p. This 
is in fact true and can be proved as follows. We have the congruence 


(=o) Seg" y -tiod’9): 


Now, by a q-series identity of Jacobi, we have 


ie) : ioe) A 
[[@ +q°") = Nog +4 


n=l m=0 
Thus, we deduce that 
lee) lee) > lee) 3 
> t(n)q" =q ‘: qin 4m = a g2"t)" (mod 2). 
n=1 m=0 m=0 


In particular, t (1) is odd if and only if n = (2m + 1)”, in other words, if and only if 
n is an odd square. In particular, t(p) is even for every prime p. 


6 Congruences Satisfied by the t-Function 


The result of the previous paragraph is that 
T(p) =0 (mod 2) 


for all primes p. There are many other congruence relations discovered by Ramanu- 
jan. Here is a partial list: 


(1) t(p) =1 + p? (mod 29) 
(2) t(p) =1+ p (mod 3) 

(3) t(p) =p + p'” (mod 5*) 
(4) t(p) = p+ p* (mod 7) 
(5) t(p) =1+ p!! (mod 691) 


More congruences were discovered later by other authors including Bambah, 
Chowla, Gandhi, Swinnerton-Dyer and Wilton. 

As mentioned briefly earlier, Ramanujan was also the first to find congruences 
satisfied by the partition function. Some of these are 
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(1) p(5m +4) =0 (mod 5) 
(2) p(7m +5) =0 (mod 7) 
(3) p(11m + 6) =0 (mod 11) 


In Chap. 7, we will indicate how such congruences can be proved. 


7 Vanishing of the t-Function 


Of the many problems that are open with respect to the t-function, there is a con- 
jecture of Lehmer [111] that asserts that 


t(p) #0 


where p is a prime. Equivalently, 
tT(n) £0 


for any n > 1. In fact, we have the following elementary result of Lehmer. 


Proposition 7.1 Let no denote the least value of n for which t(n) = 0 (if it exists). 
Then no is prime. 


Proof The multiplicativity of the t-function (5) shows that ng is a prime power, say 
no = p*. Suppose that a > | (in other words, t(p) 4 0). Then from (7) we deduce 
that 


sin(a + 1)6, =0 
and so 

6, =ka/(a+ 1) (8) 
for some integer k. The number 


4(cos6p)* = t(p)?/p'! 


is rational. On the other hand, by (8), it is an algebraic integer. Thus, it is in fact 
an integer, say m, and it is also clear that m > 0. Again, by (8), @p is real, and so 
|cos@,| < 1 and som < 4. Then 


t(p)? = mp'! 


and as T(p) is an integer, m must be divisible by p. Putting all of these constraints 
together, we see that p =m = 2 or p =m =3. This implies that 


t(2)=+2° or 1(3)=+3°. 
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But neither of these can hold as we see from the tables above: 1(2) = —24 and 
tT (3) = 252. This completes the proof. 


We also have the following curious result. 


Proposition 7.2 Suppose that the set {n : t(n) = 0} has density zero. Then it is, in 
fact, empty. 


Proof Suppose that t(n) = 0 for some n. Then by the previous result, the least such 
n is a prime number, p (say). It follows that if n = mp with p{m, then t(n) =0. 
Thus, the set 


{n :T(n) =0} 


has density > (p — 1)/p, contradicting our hypothesis. 


Using the congruences satisfied by the t-function, it is possible to show that no 
(if it exists) must be quite large. Indeed, Lehmer himself observed that if t(p) = 0, 
then the congruences imply that 


p?=-1 (mod2°). 
Since p 4 2, we also have that 

p'°=1 (mod2°). 
These two congruences together imply that 

p’=1 (mod 2°) 

from which it follows that 

p=-1 (mod2°). 
Similarly, one finds p =—1 (mod 2° -3- 57 - 691). This immediately implies that 

p>2-3-5*-691 — 1 = 1,658,399. 


In fact, one can do much better, and it is currently known that t(p) 4 0 for p < 10. 
Of course. Lehmer’s conjecture is that it should be nonzero for all p. Recently, 
this conjecture has been related to the irrationality of coefficients of certain mock 
modular forms. 

There is an even stronger conjecture than that of Lehmer, which has been sug- 
gested by Atkin and Serre. For every € > 0, they ask whether 


[z(p)| >< p?-*? 
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Nothing is known about this conjecture. In terms of the angle 6, introduced earlier, 
the above conjecture is equivalent to the assertion that 


4 


6 
a) 


I+e° 


1 
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If we consider t (1) rather than t(p), it is possible to prove a kind of lower bound. 
It is shown in [134] that there is an effectively computable absolute constant c > 0 
such that for all positive integers n for which t (7) is odd, we have the lower bound 


|c(n)| => (logn)°. 


The condition on the parity of t(m) ensures that it is not divisible by the first power 
of any prime. Thus, the set of n for which the result applies is the so-called squarefull 
numbers (that is, numbers for which every prime divisor occurs to at least the second 
power). 

It is also interesting to ask for lower bounds that hold infinitely often. Hardy 
showed that 

t(n)>n i 

holds infinitely often. The best result in this regard is to due to R. Murty [128], who 
showed that there is an absolute and effective constant c > 0 such that 


|c(n)| sal 


exp{clogn/loglogn}. 

This result is essentially best possible since we know that 
d(n) < exp{c’ logn/loglogn} 

and by Ramanujan’s conjecture (Deligne’s theorem), we have 


|c(n)| < d(nyn''/. 


With recent developments on the Sato—Tate conjecture (see Chaps. 10 and 12), these 
results are valid for any c < log?2. 


8 Divisibility of t(p) by p 


The many congruences satisfied by the t-function all have a fixed modulus and vary- 
ing argument. A different kind of congruence is the one in the title of this section, 
namely whether 


t(p)=0 (mod p). (9) 


Indeed, this does occur, and the first example is p = 2 as we have already seen 
above. The only primes known to satisfy (9) are p = 2, 3,5, 7, 2411, 7758337633. 
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(This last prime was discovered quite recently [116].) It is not known whether there 
are infinitely many such primes. Neither is it known that the complement is infinite! 
In this regard, the situation is similar to that of Wieferich primes. 

Heuristic reasoning would suggest that 


#{p <x: t(p) =0 (mod p)} ~ loglogx. (10) 


However, the function loglogx grows so slowly that it is computationally difficult 
to distinguish it from a constant. (For example, loglog 10!> is approximately 3.54.) 
If (10) were true, we would expect that 


1 


lo 
T(p)=0 (mod p) &P 


converges. However, we do not even know whether 
aS 
T(p)=0 (mod p) 


converges. 
Suppose that p is a prime for which (9) holds. Then from the relation 


t(p**!) = c(p)t(p%) — p''t(p*") 
we see that for all a > 1, 
t(p*)=0 (mod p*). 
We also note that if t(p) =0 (mod p) and n = pm where (p,m) = 1, then 
(t(n),n) #1. (11) 


If the set of primes for which (9) holds has positive density, then it follows by an 
elementary sieve argument that the set of n for which (11) holds has density 1. This 
latter statement can be proved unconditionally. In fact, one knows that 


#{n eco aes (x(n), n) = 1} <« x/logloglogx. 


It is interesting to note that if there are infinitely many primes p such that t(p) = 
0, then for any value of k, there exist infinitely many values of n such that t(n) = 
tT(n+1) = t(n+2) =---=t(n+k) = 0. This is an easy exercise using the Chinese 
remainder theorem. 


9 Lehmer’s Conjecture and Harmonic Weak Maass Forms 


Finally, we report on some recent work of Bruinier, Ono and Rhoades [27] that 
opens another line of investigation for Lehmer’s conjecture. We give a brief (and 
slightly technical) explanation of this new development. 
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After the study of classical modular forms, it is natural to study the space of 
weakly holomorphic modular forms which are just meromorphic modular forms 
whose singularities may only occur at the cusps. These spaces are contained in the 
larger space of harmonic weak Maass forms which need not be holomorphic but 
are annihilated by a second-order differential operator. At the cusps, we allow for 
exponential growth of a special type. To be precise, let 


A e + Z + ik : +i : 
= —s+-= i —+i—)}. 
5 y ax? dy? Y\ Ox dy 
Let x be a Dirichlet character modulo N. A harmonic weak Maass form of weight k 


on Io(N) with Nebentypus x is a smooth function on the upper half-plane satisfying 
the following three conditions: 


b 
(1) (=) = x(d)(cz +.d)* f(z) for au( a) € Id(N): 


(2) Ax f =0; 

(3) there is a polynomial Py = >>, <9 cy (n)q” € C[q7!] such that for some fixed 
€ > 0, f(z)— Pr(z) = Oe *”) as y tends to infinity, with analogous conditions 
at the other cusps. The polynomial Py is called the principal part of f at the 
corresponding cusp. 


This vector space of harmonic weak Maass forms is denoted Hy (J0(N), x). One 
can show that every weight 2 — k harmonic weak Maass form f(z) has a Fourier 
expansion at each cusp of the following form: 


f= Dl cha" t+ docF@r(k = 1,4eInly)q", 


n>—0o n<O0 


where I"(a, x) is the incomplete Gamma function given by 


[o,@) 
I'(a,x) = (eet adh. 
x 
The differential operator 


Ey = 2iy 


sl 


has the property that 
4: Hox (O(N), x) > Se(0(N), X)- 


A straightforward calculation shows that 


Ge f) = 4m)" Dc nn 1g". 


n=1 
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In other words, the coefficients c; (n) are really coefficients of classical cusp forms. 
Now let g € S;(Io(N), x) be a normalized newform. We say that a harmonic weak 
Maass form f is good for g if the following conditions are satisfied: 


(1) The principal part of f at the cusp oo belongs to F,[q7'], where Fy is the 
number field obtained by adjoining the Fourier coefficients of g to Q; 

(2) The principal parts of f at the other cusps of [g(V) are constant; 

(3) &_-x(f) = g/(g, g), where (-,-) denotes the Petersson inner product. 


The main theorem of [27] is that if 


ee) 


g= Yo cg(n)q” € Sk(To(N), X) 


n=1 


is a normalized newform and f € Ho_;(Jo(N), x) is good for g, then for any p 
coprime to N for which cg(p) = 0, we have ct(n) algebraic for any n with ord, (7) 
odd. In other words, the vanishing of the Fourier coefficients of a Hecke eigenform 
implies the algebraicity of the Fourier coefficients of the corresponding harmonic 
weak Maass form. The authors in [27] discuss the case for Lehmer’s conjecture in 
this context. 


Chapter 3 
Ramanujan’s Conjecture and £-Adic 
Representations 


Let us recall that Ramanujan’s conjecture asserts that for all primes p, we have 
|z(p)| < 2p". 


The key to understanding this is the modular nature of A(z). There is an action of 
the modular group 


SL2(Z) = {(¢ 1) 4sbie.d 2, ad —be= 1 
on complex numbers with positive imaginary part, the so-called upper half-plane 
H = {zeC:3(z) > 0}. 
The action is given by fractional linear transformations 
a b az+b 
(: 4) a cz+d 


Under this action, A has the following transformation property: 


A(yz) = (cz +d)!" A(z). 


This, and the holomorphy of A on 1 as well as at the boundary of H, make A a 
modular form. The weight of a modular form is determined by the power of the 
automorphy factor 


J(y,2) =(ez+d) 


that appears in the transformation property. Thus A is a modular form of weight 12. 
The g-expansion 


0° 
A@= iting", quer 
n=1 
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is then seen as the Fourier expansion of A (at the cusp ico). The fact that the Fourier 
expansion has no constant term (that is, the coefficient of g° is 0) designates A as a 
cusp form. 

Petersson generalized Ramanujan’s conjecture to other modular forms which are 
cusp forms. More precisely, he conjectured that for a cusp form of weight k and 
having a Fourier expansion 


[ee 


f= Sang” 


n=1 


at the cusp ioo, the coefficients satisfy the inequality 
Ja(p)| Ke p&-Virt 


for any € > 0. 


1 The Weil Conjectures 


Let X be a smooth projective variety over a finite field F of q elements. Let us fix 
an algebraic closure F of F and for each integer n > 1, denote by F,, the unique 
extension of F of degree n contained in F. The function 


CO T? 
£(X, n=en( ben / | (12) 


n=1 


is called the zeta function of X. For example, if X =P! is the projective line, then 
|X (F,)| =q” + 1, and an easy calculation shows that 


1 
1 = 
ee (l—T)—gqT) 


More generally, if X = P”, writing projective space as a stratified union of affine 
spaces of smaller dimension, we find that 
1 
Ca DdSqNa@sqhs-daguy: 
Weil made three conjectures about this class of zeta functions. First, he conjec- 


tured that it is a rational function of JT. More precisely, he conjectured that there are 
polynomials P;(X, T) € Z[T] for 0 <i <2dimX so that 


PE) = 


2dim X 


in|, Paper (13) 
i=0 


with Po(T) —= ad = T) and Podim x (T) = él _ quan), 
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He also conjectured that the P; satisfy a functional equation relating P;(X, 7) 
to Podimx—i(X, (qr T“)) where a; and b; are integers determined by the topology 
of X. Finally, and perhaps most strikingly, he conjectured that the polynomials P; 
can be factored as 


2b; 
P(T)= | [Gd -04,;T) (14) 
i=) 


and the a;,; are complex numbers satisfying 
joi) = i? (15) 


for all relevant values of 7. This third conjecture is called the Riemann hypothesis 
for X. 

To understand this terminology, we have to rely on another (equivalent) definition 
of the zeta function. Let us set (see [182]) for a complex number s, 


1 -1 
6(X,s) = (1 ~ -) 
I] N(P) 


where the product is over the closed points P of X, and N(P) is the order of the 
residue field of P. This is the geometric analogue of the Riemann zeta function. We 
assume that )i(s) is sufficiently large so that this product converges. Then, we have 
the relation 


(G)=29") 


where q is the cardinality of F. In this case, (15) and (14) show that the zeros 
of ¢(X,s) satisfy H(s) = 1/2,3/2,..., dim X — 5 and the poles satisfy Xt(s) = 
0,1,...,dim X. In the sense that (15) allows us to locate the zeros and poles of 
¢(X,5) on certain vertical lines, it can be said to be an analogue of the Riemann 
hypothesis. 

The original motivation of Weil in proposing his conjectures was to address the 
problem of counting the number of points on varieties over finite fields. If we loga- 
rithmically differentiate relations (12), (13) and (14), we deduce that 


2dim X 2b; 
Ix@)|= D> OC vie, 
i=0 j=l 
and using (15), this becomes 


|x@,)| ST hain & + Ox(q-”)} 


where the constant implicit in the error term depends on X. 
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2 The Case of Elliptic Curves 


Weil himself proved his conjectures in the case that X is a curve (that is dim X = 1). 
In this case, the expectation is that 


Z(X,T)= oa (16) 
(—-T)d—-4T) 


where P(T) € Z[T], and the degree of P(T) should be 2g where g is the genus of 
the curve X. 

The case of elliptic curves is of particular interest. Here we have g = 1, and 
so P(T) is a quadratic polynomial. Understanding this case provides insight into 
Weil’s intuition that there should be a relationship between the topology of X and 
the number of solutions of various congruences (number of points on X). 

Over the complex numbers, an elliptic curve E is a one-dimensional torus 


E(C)=C/L. (17) 


Here L Cc C is a lattice, in other words, L ~ Z @ Z. We see that for any 1 <n € Z, 
we have 


1 

E[n)=—-L/L ~ Z/nZ@Z/nZ 
n 

where E[n] denotes the set of points R in E(C) satisfying nR = 0. The subgroups 

E[n] form a directed system under the partial ordering on Z given by divisibility, 

with natural morphisms 


E[n|] —> E[m] 
whenever m|n given by 
n 
Rv —R. 
m 
The inverse limit under these maps is then seen to be 


L@Z~|[L@Z). 
£ 


In particular, for each prime @, the ¢-primary component is denoted 7;(E) and is 
called the Tate module of EL. We see that as abelian groups 


Te(E)~L@Ze~Z. GZ. 


For elliptic curves over more general fields F’, one no longer has the uniformiza- 
tion given by (17). However, the groups E[n] still make sense, as does T;(E). More- 
over, if @ is different from the characteristic of F’, we still have 


T(E) =~ Ze © Ze 
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as abelian groups. But if F is not algebraically closed, there is a new structure that 
emerges. If we choose an algebraic closure F of F, then the E[n] and T;(E) are 
not only abelian groups, but modules for the natural action of Gal(F'/F). Thus, we 
have a representation 


Gal(F/F) —> Autz, (Te(E)) ~ GL2(Zp). 


In particular, if F = F is a finite field of q elements, we have the Frobenius 
automorphism o € Gal(F'/F’) which is given by 


o(x)=x!. 


For (€, q) = 1, we thus get a matrix in GL2(Z¢). If we denote by P¢(T) the charac- 
teristic polynomial of this matrix, it is a fact that 


Po(T) = P(T), (18) 


where P(T) is given by (16). In particular, Pe(T) has coefficients in Z (instead of 
just Z,) and is independent of €. 

If we now consider an elliptic curve E over a number field F’, the above con- 
structions still make sense, and we can speak of the Tate module 7;(E). We again 
have the family of Galois representations 


Gal(F/F) —> Autz, (Ty(E)) ~ GLz(Z) (19) 


giving the action of the Galois group on points on E of ¢-power order. 

If v is a prime of F’, we may consider the reduction E,, of E modulo v. Naively, 
this just means that we reduce the coefficients of an equation defining E modulo v. 
Apart from a finite number of primes v of F,, we have that EF, is again an elliptic 
curve, and it is defined over the residue field F, which is a field of cardinality equal 
to the norm Nv of v. 

As in the previous paragraph, we may consider (now for ¢ not dividing Nv), the 
Tate module T;(E,). On the other hand, we may consider the original T;(E) but now 
viewing it as a module restricted to just the decomposition group at v. It is a theorem 
that under the assumption that E,, is an elliptic curve, the inertia subgroup at v acts 
trivially on T;(E), and thus it may also be viewed as a module for Gal(F/ F). (The 
converse of this theorem is also true. Thus, if the inertia subgroup at v acts trivially 
on 7;(E), then the reduction E, is again an elliptic curve.) 

Now we have two modules for Gal(F/ F), namely T(E) and T;(£,). It turns out 
that they are isomorphic. In particular, we may speak of the characteristic polyno- 
mial of Frobenius at v acting on 7;¢(£), and it is the same Pe(T) = P(T) that we 
found in (18). 

Considering again the global Galois representation (19), we now have a family of 
£-adic representations which have the property that for any prime v of the number 
field F at which the inertia subgroup acts trivially, the characteristic polynomial of 
Frobenius at v makes sense, and if we denote it by Pe(T, v), then it is an element of 
ZT] and is independent of ¢. 


30 3  Ramanujan’s Conjecture and ¢-Adic Representations 


Moreover, P,(T, v) is the numerator of the zeta function of an elliptic curve E, 
over a finite field. It is thus of the form 


Pc(T,v) =1—a,T +(Nv)T? 
where a, € Z, and applying the Riemann hypothesis for Ey, it follows that 
lay] < 2(Nv)?. (20) 


We may also use this information to construct a formal product 


[]G. = avevey? + vay 9)8 


Vv 


The product only ranges over primes v for which EF, is an elliptic curve. Using 
estimate (20), we see that the formal product actually converges absolutely for 
Re(s) > 3/2. Up to finitely many factors (namely those at primes v for which Fy, 
is not an elliptic curve), this function is called the L-function associated to E and 
denoted L(E, s). By analogy with the definition in the first section, the zeta function 
of E is given by 


C(E,s)=(s)o(L — s)/L(E, s). 


3 £-Adic Representations 


More generally, let us consider a number field F and a family My of finite- 
dimensional Ze modules which carry an action of Gal(F'/F’). We say that the rep- 
resentation (or equivalently, the module M¢) 


Gal(F /F) —> Autz, (Mz) 


is unramified at a prime v of F if the inertia subgroup at v acts trivially. We say that 
the family {Me} is unramified at v if every element Me of the family is unramified 
at v. Finally, we say that {M,} is a compatible family of £-adic representations if 
there is a finite set S of primes of F such that for all primes v ¢ S, the family is 
unramified at v and for any prime £ that is relatively prime to 


Nv [[ y~. 


wes 


the characteristic polynomial P¢(7, v) is independent of @ and has coefficients in Z. 

There are many such families that arise from algebraic geometry. The example of 
elliptic curves was seen in the previous section. Now the question is whether there 
is such a family with F = Q and unramified at all primes, and with the property that 
for p+ £, 


P.(T, p)=1—t(p)T + p!'T?. 
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If there were such a representation, then there would be the hope that the Ramanujan 
conjecture could be put into a context where the Weil conjectures would apply. In 


that case, it might even be possible to deduce the Ramanujan conjecture from the 
Weil conjectures. 


4 Elliptic Curves and Modular Forms 


An example may illustrate what we can expect. Writing 


=~ 24 = 2 = 22 
a] [0-a') =a] 0-4") T]G-4’) 
n=1 n=1 n=1 
we see that 
lee) [o@) lee) 
So cinq" =a] [(l-a") [i -a'")? mod 11). 
n=1 n=1 n=1 


Using the notation of the chapter on the Ramanujan t function, the right-hand side 
of the congruence can be written as n(z)*n(11z)? (where as usual, we are writing 
gq = e*"'®), Expanding the product, and writing 


n(z)?n(11z)? =) ang” 


n=1 
we have the following table of values: 
n an n an n an 
1 1 6 2 11 —-l 
2 —2 7 —2 12 —2 
3 -l 8 0 13 4 
4 2 9 =2 14 
5 1 10 —2 15 -1 


The function f(z) = n(z)2n(1 1z)? is an example of a modular form. It is a holo- 
morphic function defined for complex numbers z with the property that 3(z) > 0. 
Moreover, it satisfies a functional equation with respect to certain fractional linear 
transformation. Namely, if we denote 


ran) ={(2 f) tebe d eZ, ad —be=1,N\e| 
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then 
(==) = (cz +d) F(2) 
cztd) , 
Since 
d (az+b _9 
sce = d 
(SS) ere) 


it follows that f(z) dz is invariant under the action of [(N). 
Now we define a map 


Py: I0(N) > C 


as follows. Fix a point z9 € C with 3(zo) > 0. Set 


VZ0 
Py (y) =I f(z) dz. 
20 


The integral on the right does not depend on zo. Indeed, we have 


YZ1 20 Y20 YZ1 
Z1 Z1 Z0 Y20 


By achange of variables and the invariance of f(z) dz under the action of y, we see 
that 


YZ1 Z1 Z0 
f@dz= / (Gage / fds. 
Z0 Zi 


YZ0 
Thus, 
YZ 


YZ0 
fides [ f(z) dz. 
Z0 


Z1 


Note also that as the upper half-plane is simply connected, the value of the integral 
does not depend on the path chosen from Zo to y Zo. 
The map @ is in fact a homomorphism since for 71, y2 € [o(N), we have 


Y2%0 V1 220 
em) = | fac [ f (2) dz. 
Z0 220 


Again by a change of variables and using the invariance of f(z) dz under y2, we see 
that this is equal to 


Pe (Mi¥2) = Pf (M1) + Pe (2). 


The elements in J(V) can be classified into three types depending on their trace. 


Thus an element 
_ fa b 
soma eae 
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is called elliptic if |trace(y)| = |a + d| < 2, hyperbolic if |trace(y)| > 2 and 
parabolic of | trace(y)| = 2. 


Proposition 4.1 [fy € Ip(N) is elliptic or parabolic, then © f(y) = 0. 


Proof If y is elliptic, then it has finite order. Since @ (1) = 0, and @f is a ho- 
momorphism, it follows that ®¢(y) = 0 for any element of finite order. If y is 
parabolic, then a + d = +2. Note that —I € Ij(N). Moreover, as it is of order 2, 
by the comment above, ® ¢(—/) = 0. Hence, multiplying y by —/ if necessary, we 
may suppose that a + d = 2. Suppose first that c = 0. Then 


p54) 


Zot+r 
&;(y) = / F(2dz. 
Z0 


Then 


As f is acusp form, its constant term at the cusp ioo vanishes, and so 
zotl 
/ f(@@dz=0. 
20 


Iterating this, we deduce that ® ¢ (vy) = 0. Now suppose that c ¥ 0. As y is parabolic, 
it fixes the rational point (a — 1)/c. Choose a matrix 6 € SL2(Z) such that 6 maps 
this point to ico. Then 


vi =BbyB' 


v2 Bo! Bz0 
/ faces f F(z) dz. 
z 20 


0 


fixes ico. We have 


The right-hand side is equal to 


yi BZ0 ; Bzo+r i 
[ (FIB- Jorac= f (FIB dz. 


And this is zero as f is a cusp form and the constant term of f|B~! vanishes. This 
proves the result. 
Let us set 


Ar ={Pr(y): y ETON). 


It is a theorem of Eichler and Shimura that if f has integer Fourier coefficients, then 
Af is a lattice in C. (In general, if the Fourier coefficients lie in a number field, 
we would consider a higher-dimensional version of Ay which would give rise to a 
lattice in C% for some g > 1.) 
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Using this lattice, one may then consider the torus C/A which is an elliptic 
curve. At first, it is a curve defined over the complex numbers. However, the theory 
of Eichler and Shimura shows that it in fact has a model defined over the rationals Q. 

As an example, consider [(11). It is generated by the three elements 


ih “a 8 1 9 1 
r= (¢ a nu=(5, a vo=(35 5). 


Since T is parabolic, ® ¢(T) = 0. If we set w; = ®f (V4) and w2 = Pf (Vo), then 
Af is the lattice spanned by w and wp. In fact, the corresponding elliptic curve is 
given by the equation 


y? ty =x? — x? — 10x — 20 


and is denoted X0(11). 
As discussed earlier, we have a compatible family of ¢-adic Galois representa- 
tions 


Gal(Q/Q) —> Autz, (Te(Xo(11))) ~ GLa(Ze) 


which are unramified outside 11. Moreover, recall that we have set 


f@= So ang”. 


n=1 


Then for any prime p # 11, the characteristic polynomial of the Frobenius automor- 
phism at p is 


l—apT + pT”. 


If we “reduce” the | 1-adic representation modulo 11, that is, if we compose it with 
the natural map 


GL2(Z11) —> GL2(Z/11Z) 
then the characteristic polynomial of the Frobenius automorphism at p is 


1—(a,) (mod 11))T+(p (mod 11))T?. 


Now by the congruence given at the start of this section, it follows that this is the 
same as 


1—(t(p) (mod 11))T+(p (mod 11))T?. 


It is a theorem of Shimura that the map 
Gal(Q/Q) —> GLy(Z/11Z) 


is surjective. It follows from an application of the Chebotarev density theorem that 
for any residue classes a, b € Z/11Z with b ¥$ 0, we can find infinitely many primes 
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(in fact a positive density set of primes) p for which 
T(p)=a_ (mod 11) 


and 
p=b (mod 11). 


Thus, there cannot exist any special congruences for the t function modulo 11. 

We see from this that the theory of €-adic representations gives us a tool to “un- 
derstand” congruences satisfied by the t function. However, the construction de- 
scribed in this section only gives us a “mod €”-representation and that too, only for 
one value of £, namely € = 11. The question now becomes where one should look 
for the general family of representations. 

Since the motivation in this chapter is Ramanujan’s conjecture on the size of 
T(p), we have to look for these representations in algebraic geometry. 


5 Geometric Realization of Modular Forms of Higher Weight 


We begin by giving a brief description of how modular forms of higher weight occur 
in the cohomology of certain algebraic varieties. Let J” denote a subgroup of finite 
index of SL2(Z). It acts by fractional linear transformations on the upper half-plane: 


a b pees +b 

ce d} ~~ cz+d’ 
Consider now the product H x C of the upper half-plane and the complex line. 
There is an action of I” » Z on this product by 


(¢ 2) cna (th rarer 


We consider the quotient (7 x Z*)\(H x C). This is in fact an open algebraic sur- 
face, and there is a compactification Br of it called the ‘universal elliptic curve’. 
The terminology comes from the fact that it is a fibre variety over the modular curve. 
The modular curve itself parameterizes elliptic curves with additional structure, and 
the fibre at a point of the modular curve ‘is’ the elliptic curve corresponding to that 
point. 

Now, let us consider holomorphic 2-forms on Br. Let f be a holomorphic mod- 
ular form of weight 3 for I and consider the 2-form f(z)dz A dw on H x C. It is 
invariant under the action of x Z?. Indeed, an element 


((e 4): ) 
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az+b az+b w+mz+n 
d A d{ ————— 
(=) (=) ( cz+d ) 


(cz +d)? f (2)(cz +d) dz A (cz +d)! dw = f(z)dzAdw. 


acts on f by 


which is 


Because of this invariance, f(z) dz A dw gives rise to a holomorphic 2-form on the 
quotient and on the surface Br. In fact, it is true that all holomorphic 2-forms on Br 
can be obtained in this way in the sense that there is an isomorphism of the space of 
cusp forms on I” of weight 3 and holomorphic 2-forms on Br. 

One can go further and analyze the entire Hodge decomposition of H?(Br, C). 
We have 


H?(Br,C) = H°(Br, 27) ® H°(Br, 22) @ H""!. 


The above discussion identifies the first two pieces. The third piece, H!-! turns out 
to consist entirely of rational (and hence, by Lefschetz, algebraic) classes. This is a 
result of Shioda [191]. 

In general, in the construction of Br, if we replace C with C‘~?, one gets what 
is known as a Sato variety Be This is a variety of dimension k — 1, fibred over 
the modular curve as before. The fibre at a given point of the modular curve is the 
(k — 2)nd power of ‘the’ elliptic curve corresponding to that point. Again, one can 
show that the space of holomorphic (k — 1) forms on pe is isomorphic to the 
complex vector space of weight k cusp forms for I”. In general, we do not know 
much about the field of definition of Be. However, if I” is a congruence subgroup, 
then we know from the general theory of Shimura and others that it is defined over 
a number field. 

Now let us restrict our attention to /” a congruence subgroup, say [o(NV). All of 
the discussion above also has an ¢-adic analogue. In this case, the complex coho- 
mology referred to above is replaced with €-adic cohomology. This theory is a vast 
generalization of the theory of the Tate module discussed earlier. In fact, the first 
£-adic homology of an elliptic curve is its €-adic Tate module. 

In general, one considers the £-adic cohomology of Be (or rather, a model of 
this variety defined over the rationals), and one finds that 7 : a (BY) contains a sub- 
space S¢ (say) of dimension 2 dim S; (J). Moreover, Ht a. (BY) has two important 
actions on it. Firstly, it is a module for the action of Gal(Q/Q), and secondly, it 
is a module for the Hecke algebra, an algebra of operators referred to in an earlier 
chapter. The subspace S¢ is invariant under both of these actions. Diagonalizing the 
Hecke action, we get a decomposition 


Se= Of Se(f) 


into submodules, where each S¢(f) is a rank two module over K ¢[Gal(Q/Q)], and 
Ky is acertain number field (the field generated by the Fourier coefficients of f). 
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These are the Galois representations attached to modular forms of higher weight 

whose existence had been conjectured by Serre and constructed by Deligne [38]. 
Moreover, the Galois action is unramified outside primes dividing €N, and if 

pt €N, then the characteristic polynomial of the Frobenius conjugacy class at p is 


| [G -4r(y°T + p17’) 


the product ranging over embeddings 0 : Kr Q,. If we factor the quadratic 
polynomials above, we get complex numbers zr (p)° such that 


ap(p)? =z (p)? +74 (p)? 


and 


m ¢(p)° 1 (py? = pe. 


It is the reduction of the variety Be that one applies the Weil conjectures to. 


By the above remark about the characteristic polynomial of Frobenius, we see that 
when Be is seen as a variety over F,, in the action of Frobenius on H, as ae x 
F,,), amongst the eigenvalues of Frobenius will be the {7 (p)} and by the Weil 
conjectures, these numbers are all of complex absolute value p“~)/?. This is, in 
brief, how the Ramanujan conjecture is deduced from the Weil conjectures. The 


reader may find the survey article [91] also helpful. 


Chapter 4 
The Ramanujan Conjecture from GL(2) to 
GL(n) 


1 The Ramanujan Conjectures 


In retrospect, one sees a progression of ideas from Ramanujan’s work on the t- 
function to Hecke’s theory of modular forms, and then moving to the representation 
theoretic perspective, we see this progression linking itself to theories of Harish- 
Chandra and Langlands. Ramanujan’s conjecture now sits as a special case of a 
more comprehensive conjecture in the Langlands program. The survey article [141] 
provides an excellent introduction to this chain of ideas, and we recommend this 
to the reader. But the origins of the chain go back to the 1916 paper of Ramanujan 
which acted as a catalyst for this development. 
In his epic paper of 1916, Ramanujan [162] considered the function 


CO 
A(z) =q T[@ _ q’)*, ga emt, 


n=1 


As we have seen, expanding the right-hand side as a power series in g defines the 
celebrated Ramanujan t-function: 


[o,e) [o.e} 


a] [GQ -4a")* => cq". 


n=1 ASl 
In his paper, Ramanujan made three conjectures concerning T (1): 


(1) t(mn) = t(m)t(n), for (m,n) = 1, 
(2) for p prime and a > 1, t(p*t!) = t(p)t(p®) — p!!t(p*"}), 
(3) |t(p)| < 2p". 


The first two conjectures were proved a year later by Mordell [126]. His proof 
was reproduced by Hardy in his 1936 lectures on Ramanujan’s work delivered at 
Harvard University. Almost twenty years after Ramanujan’s paper, Hecke [77] pub- 
lished the conceptual framework from which (1) and (2) emerge as special cases 
of a larger theory, now called Hecke theory. Despite heroic efforts to settle conjec- 
ture (3) by such luminaries as Hardy, Kloosterman, Rankin and Selberg, we had to 
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wait until 1974 when Deligne proved it as a consequence of his proof of the Weil 
conjectures. 

We outline the proofs of these conjectures. As mentioned in the introduction to 
Chap. 3, the essential property of A(z) is that it is a modular form of weight 12 for 
the full modular group SL2(Z), which is the group of 2 x 2 matrices with integer 
entries and determinant 1. This means that for z in the upper half-plane h, we have 


az+b =. 12 a b 
a(S?) <4 A(z) v(2 {) € Sla(@). (21) 


The set of holomorphic functions f : h — C vanishing at infinity and satisfying the 
modular relation 


b 
(4) -cs+atve v(é 4) € Sta@) 


forms a finite-dimensional vector space S; over C. Since —I € SL2(Z), the modular 
relation shows that S; = 0 for k odd. For k even, one can show without too much 
difficulty [183] that dim $2 = 0 and dim S; = [k/12] if k #2 (mod 12). Thus, $2 
is one-dimensional and spanned by A. In fact, 12 is the first value of k for which 
Sr #0. 

Hecke discovered a family of linear transformations T,, of the finite-dimensional 
vector space S;. These are now called Hecke operators, though they are nascent 
in Mordell’s work. To describe these, it is convenient to introduce the following 
notation. 

Let GL} (R) denote the group of 2 x 2 matrices with real entries and positive 
determinant. It is easy to see that GL} (R) acts on § via the usual fractional linear 
transformations: 


az+b C b 
= y= 


= L} (R). 
YZ cz+d’ 4) GL ® 


Given f € Sx, we define the “slash operator” via 


(fly)(z) = (det yy)‘ (cz +d) f (v2). 


Thus, f € Sy if and only if f|y = f for all y € SL2(Z). 
Hecke observed that SL(Z) acts on the set M,, of matrices with integer entries 
and determinant n. Indeed, with = SL2(Z), we have 


TI My = MI = Mn. 


Consequently, we can decompose M,, as a disjoint union of J’-orbits. In fact, a 
simple application of the division algorithm shows that 


M.= |] ur(3 ?: 


ad=n,d>0 b=0 
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Thus, the finite set of matrices 


é ay ad=n,d>0, 0<b<d-l, 


comprises a complete set of orbit representatives for this action. Hecke defines 


d-1 
Tr(fyant Le “ (22) 


ad=n,d>0 b=0 


Since I’M, = M, I" = My, it is easily seen that for any y € I”, we have 


In(Mly = Tn(f). 


Noting that 7,,(f) also vanishes at infinity whenever f does, shows that T,(f) € Sx. 
In other words, for each natural number n, we have a linear transformation T,, : 
Sx — Sz. These transformations are called Hecke operators. Since 


1 1 
f 1) € Sta(@), 


every f € S, has a Fourier expansion 
[o,@) 
Fo) =D apomernine, 
m=1 


The expansion starts from | since the function is holomorphic and vanishes at infin- 
ity. From (22) one can routinely compute the Fourier expansion of T,,(f). Indeed, 


[o,@) 
b = im(az 
f| ( 4) (z) = nkl2q k > jeje en, 


m=1 


Inserting this into the inner sum of (22) and applying the elementary fact 


d-1 
Oy ai = 0, 
b=0 


unless d|m, in which case it is d, we obtain that 


[o,@) 

ipjan” > a a Aimer) 
ad=n,d>0 m=l,d|m 

Writing m = dr, we can re-write the inner sum as 


ad . 
Ap (dryer, 


r=1 
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Collecting the terms with ra = m, we see that 


iiij=an ~( aplaryat) erin: 


m=1 ‘ad=n,ar=m 


The coefficient can be re-written as 


> Af (nm/a’) (n/a)'-* 


a\n,m 


so that 


[e.e) 


L<p)\= a > Ay (nmja?)ak) enim, (23) 


m=1 ‘a|n,m 


In the case of f = A, we see immediately that 7,,(A) = cy, A since S}2 is one- 
dimensional. The constant c, is computed easily by comparing the first Fourier co- 
efficient of both sides. We obtain 


T,(A) =t(n)A. (24) 


If (m,n) = 1, then computing the mth Fourier coefficient of the left-hand side of 
(24) using (23) shows that it is t(@mn). Comparing this with the mth Fourier co- 
efficient of the right-hand side of (24), we obtain t(mn) = t(m)t(n) whenever 
(m,n) = 1, which is Ramanujan’s first conjecture. 

The second conjecture also follows directly from (23). Indeed, for p prime, 


Tp(A) = tT(p)A 
so that by comparing the p“th Fourier coefficient of both sides leads to 
t(p**!) +4 p tip?) = t(p)t(p*), 


which is Ramanujan’s second conjecture. 
The significance of Ramanujan’s second conjecture is that the power series 


% t(p*)x* 


is a rational function and can be written as 


(1—r(p)x + px?) 1. 

With regard to the third conjecture, there is a simple argument that leads to the 
estimate t(p) = O(p®). Though this was first proved by Hardy, his proof was more 
elaborate. We need only make two observations. The first is that a fundamental 
domain D for the action of SL2(Z) on the upper half plane can be taken to be the 
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standard one, namely, |z| > 1, |9%(z)| < 1/2. In other words, every element in the 
upper half-plane is SL2(Z)-equivalent to some element of D, and no two interior 
elements of D are SL2(Z)-equivalent. The second observation is that 


yPIA@|, z=xtiy 


is invariant under the action of SL2(Z) as is easily checked using the modular trans- 
formation for A. From the Fourier expansion of A(z) we see that it has exponential 
decay at infinity, and therefore it is bounded (say by K) in the fundamental domain 
D. Since 


[o,@) 
Ate tiy)= prime mers 


n=1 


we can calculate the nth Fourier coefficient by 


1/2 
t(nje 7" = / A(x + iy)e 7" dx. 
-1/2 


Since |A(z)| < K/y°, we deduce 
|c(n) |e?" < K/y°. 


Choosing y = 1/n gives the estimate t(n) = O(n®). 
With Ramanujan, we can consider the Dirichlet series 


which by our modest estimate of t(m) converges absolutely for #t(s) > 7. By Ra- 
manujan’s conjecture (1) and (2), we can write this as an Euler product: 


The modular relation (21) implies that 
A(-l/iy) =y" Atiy) 


and this can be used to derive an analytic continuation and functional equation for 
La(s). Apparently, this was first done by Wilton in 1928. This is a major theme of 
Hecke theory developed systematically, again, much later in the 1930s. 

Seated in the comfort of the modern era, it is easy for us to be surprised that some 
obvious developments did not take place earlier or with greater rapidity. We must 
however understand that much of this work seems to be against the background of 
two world wars and there was no one giving us the “bigger picture”. In his book, 
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Lang [106] wrote, “Partly because of Hitler and the war, which almost annihilated 
the German school of mathematics, and partly because of the great success of certain 
algebraic methods of Artin, Hasse, and Deuring, modular forms and functions were 
to a large extent ignored by most mathematicians for about thirty years after the 
1930s. Eichler, Maass, Petersson and Rankin were the main exceptions.” 

This was the case with even Hecke theory. Much more is true about the Hecke 
operators. A straightforward calculation using (23) shows that 7, Tin = Tin Ty for all 
m,n. It was only in 1939 that Hecke’s student, Hans Petersson introduced an inner 
product on S; by defining 


—_—dxd 
a= | *ror@ =>. 
D y 
He was able to show that 
(Tn(f), 8) = (fF. Tn()) (25) 


so that the 7,,s are a commutative family of Hermitian operators. Already in the 
proof of (25), it was clear that one needed to move to “higher levels” and not just 
confine oneself to SL2(Z). Petersson’s theorem allowed him to prove that S; has a 
basis of eigenfunctions for all of the Hecke operators 7, since (by a familiar theorem 
of linear algebra) the 7;,s could be simultaneously diagonalized. 

The theory generalizing the first two conjectures of Ramanujan to other congru- 
ence subgroups of SL2(Z) was slow in maturing. It was only in 1970, when Atkin 
and Lehner [12] published their fundamental paper that the first part of the theory 
reached completion. 

As for the third conjecture, this had several strands of development. The first 
strand was largely a development of analytic methods of attack beginning with 
Kloosterman [98], who, in 1927, applied the circle method and at the same time 
devised a new way of handling the major arcs, thereby discovering the celebrated 
Kloosterman sum (about which more will be said below). Kloosterman was able to 
show that t(p) = O(p®—!/8) by his method. 

In the early 1930s, Salie and Davenport (independently) refined earlier meth- 
ods and were able to show t(p) = O(p® 1/6), In the late 1930s, Hardy’s doctoral 
student, R.A. Rankin [169] developed a new method using Eisenstein series that 
enabled one to get better estimates on t(n). This new method came to be known as 
Rankin’s method (or sometimes, the Rankin-Selberg method, since Selberg arrived 
at it independently). This method (which we will discuss later in this chapter) gives 
t(p) = O(p®!/*). 

Yet another analytic method using the theory of Poincaré series was developed 
by Selberg. This method allows one to get an explicit formula for t (7) that involves 
Kloosterman sums and Bessel functions. Selberg [180] showed that this explicit 
formula leads to the improved estimate O(p®!/*). 

Selberg’s work led to many further and fundamental developments, most notable 
of which is the Selberg trace formula. In a special case, this becomes the Eichler— 
Selberg trace formula which gives a formula for the trace of the nth Hecke operator 
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acting on the space of cusp forms in terms of class numbers of imaginary quadratic 
fields. In particular, it relates the Ramanujan t-function to such sums of class num- 
bers. 

Though these methods did not settle Ramanujan’s third conjecture, they initiated 
new and hidden connections with other parts of mathematics. The most spectacular 
is the 1965 paper of Selberg [180] where he discusses the spectral theory of the 
Laplace operator and connects it with estimates for t(7). In the same paper, he 
formulates the now celebrated Selberg eigenvalue conjecture of which we shall say 
more later. 

At the end of his paper, Selberg [180] wrote, “It seems most likely that little new 
may be expected along these lines in the near future, and that a proof of, for in- 
stance, the Ramanujan conjecture is more likely to result from future developments 
in algebraic geometry.” 

Indeed, if one looks at the Ramanujan conjecture for general Hecke eigenforms, 
then in 1954, Eichler, Shimura and Igusa solved it for the case k = 2 by noting that 
if we consider 


M(N) = \v = (: ) € SL;(Z), c =0 (mod | 


then [o(N)\b, suitably compactified, has the structure of a Riemann surface and 
consequently, can be identified as the C-locus of a curve. This curve, denoted 
Xo(N), is called the modular curve of level N and is defined over the integers. 
One can therefore consider the zeta function of this curve over the finite field of p 
elements. The “Riemann hypothesis” for this zeta function turns out to be equivalent 
to the analogue of the Ramanujan conjecture for weight 2 Hecke eigenforms. 

In the early 1960s, it was strongly believed that the Ramanujan conjecture would 
be proved similarly, not from the “Riemann hypothesis” for curves, but rather from 
considering higher-dimensional varieties and as a consequence of the Weil conjec- 
tures. In fact, Eichler, Shimura, Kuga and Ihara constructed a variety that should 
have worked but did not quite work since it was not compact. Deligne showed how 
to compactify their variety and finally, resolved Ramanujan’s conjecture. This was 
the geometric attack on Ramanujan’s conjecture, described briefly in the previous 
chapter. 

Yet another analytic method connected to representation theory was outlined by 
Langlands [108]. We sketch this method below. But first, we have to discuss another 
aspect of the Ramanujan conjecture, and this is the analogue in the context of Maass 
wave forms. 


2 Maass Forms of Weight Zero 


The Ramanujan conjecture and its generalization by Petersson opened the way to a 
better understanding of what it meant in the bigger picture. Surely, Ramanujan knew 
that this held in greater generality since in his 1916 paper, he formulated it for other 
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modular forms as well. This laid the foundation for a conceptual enlightenment. In 
the 1960s, Jacquet and Langlands [86] reworked Hecke’s theory for general auto- 
morphic forms on GL(2). Indeed, once the general notion of an automorphic form on 
GL(n) was precisely formulated, it soon became clear that classical modular forms 
comprise only half of the theory of automorphic forms on GL(2). There was an- 
other side to the story, and this is the theory of Maass wave forms. The Ramanujan— 
Petersson conjecture for holomorphic Hecke eigenforms could be extended to cover 
Maass wave forms also, and more generally to automorphic forms on GL(n). This 
became known as the Ramanujan conjecture for GL(n) and in this generality repre- 
sents a major unsolved problem in the general theory today. Its solution would have 
significant consequences in number theory and related areas. 

The conceptual insight gained by reformulating the Ramanujan conjecture for 
GL(n) in this way led to a larger understanding of another celebrated conjecture, 
namely, the Selberg eigenvalue conjecture, originally formulated in the context of 
Maass wave forms. In fact, in the adelic formulation, the Ramanujan conjecture 
“at the infinite prime” becomes the Selberg eigenvalue conjecture. We review this 
theory briefly below. 

Indeed, if we consider modular forms without the holomorphy condition but in- 
sist that our function is an eigenfunction of the non-Euclidean Laplacian 


af 0-0" 
A=- —+-— 
: Ge =) 
we atrive at the notion of real analytic forms. We may write such a function as a 
function of the variables x, y, and since f(z + 1) = f(z), we have 


fey > Gwe. 


Suppose that Af = Af. This gives us a condition on the coefficients ay(f, y), 
namely that they satisfy 
2d 29-19 
Gin FY) = (A — Aminly "anf »)- 


This is a Bessel-type differential equation with two solutions. But we only take the 
one which gives rise to a convergent series above. One can renormalize and show 
that 


f(x,y) =ao( fy’ +ag(f yy! + Yo an(f)/VKir (2 |nly)err™* 
n#~0 


where 


1* Gey 
Kir) = Al eT IE ae 
—0o 
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with A = 1/4 + r?. Maass [117] proved that the series 


an(f) 
DE In|s 


n#~0 


extends to a meromorphic function for all s € C analytic everywhere except possibly 
at s = 0 and s = 1, and satisfies a functional equation. 

The analog of the Ramanujan conjecture in this context is that for any € > 0, 
an(f) = O(n‘). The Selberg conjecture is that A > 1/4, or equivalently, r is real 
and not purely imaginary. 

In his 1970 paper, Langlands [108] indicated how the Ramanujan conjecture 
would follow if a family of L-functions discussed below admit an analytic continu- 
ation to a fixed half-plane. At the same time, he interpreted the Selberg conjecture 
as the Ramanujan conjecture “at infinity” and thus put both conjectures on an equal 
conceptual footing. (This adelic viewpoint had roots in earlier work of Satake.) We 
will outline this argument of Langlands below. 

At the moment, the best general estimate for GL(2) are due to Kim and Shahidi 
[97], who showed that a, = O(n7/®), and Kim and Sarnak [95], who showed that 
A > 0.238. 


3 Upper Bound for Fourier Coefficients and Eigenvalue 
Estimates 


In [108], Langlands also introduced symmetric power L-functions and conjectured 
that they extend to analytic functions for all values of s. He outlined how this con- 
jecture would lead to a proof of the extended Ramanujan conjecture and the Sel- 
berg eigenvalue conjecture. The general theory of symmetric power L-functions 
has come to play a central role in number theory and representation theory of auto- 
morphic forms. We sketch the argument of Langlands below. 

We begin with the elementary observation in analytic number theory 


1 2+i00 
et = sf I'(s)x* ds 
271 J2~i00 


which is easily demonstrated by contour integration and Stirling’s formula. Hence, 


= 1 
nem = [roses as 
2ni (2) 
n=1 
where 


(QS) qn 
n=1 


48 4 The Ramanujan Conjecture from GL(2) to GL(n) 


Now suppose that a, > 0 and f(s) is absolutely convergent for Xi(s) > 1+. 
Moving the line of integration to Xt(s) = 1 + € gives 


Co 
age: = O(x!**). 
n=1 
Thus, for any individual term in the sum, we have 
ane nl* = Oa); 


Choosing x =n, we deduce that a, = O(n'**), 

If we let 2 be an automorphic representation on GL(2) with local parame- 
ters, Wp», By, then one can give an intuitive description of the symmetric power L- 
functions attached to z in the following way. 

Consider, with r,, denoting the m-th symmetric power of z, 


} 
Ly(s):= L(s,m, m®=TTT (2 ) 


Pp j=l 


where we are ignoring the finitely many Euler factors that need to be modified cor- 
responding to the ramified factors. 
Consider the L-function 


L,%,1m@Fm)= |] Lts,x,7%). 
k<2m,k odd 


The proof of this identity is equivalent to the trigonometric identity 


sin36 — sin50 sin(2m — 1)@ sinmé \? 


sind sin 0 sin 0 sind 


which is easily proved by induction and left as an exercise for the reader. 

Thus, the series L(s, 7,7 ® 7m) is a Dirichlet series with non-negative co- 
efficients. If we now suppose that for each m > 1, L(s,7,rm) is analytic for 
R(s) > 1+, then its pth coefficient (for p prime) is O(p!**) by the argument 
given above. But the pth coefficient is easily calculated to be 


an 2 

m—j ef 
oe Bp 
j=l 


Moreover, |@»8p| = 1, so that if the Ramanujan conjecture is false, one of these has 
absolute value greater than 1. Without any loss of generality, suppose it is ~». Then, 
in the above summation, an dominates the sum, so we deduce 
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Taking 2mth roots, we obtain 


ap = O (per oen). 
and letting m tend to infinity, we obtain a») = O(1), which is the Ramanujan con- 
jecture. 

Using the fact that each of the L-functions L(s, 7, rj) satisfies a functional equa- 
tion, one can improve the estimate using a classical result of Chandrasekharan and 
Narasimhan [32]. This result says that if a, > 0 and f(s) = bDeeat, an/n* is conver- 
gent in some half-plane, has analytic continuation for all s except for a pole at s = 1 
of order k and satisfies a functional equation of the form 


QO A(s) f(s) =wQ! “AU —s)f(1—s) 
where Q > 0 and 
A(s) =| [ P@is + Bi) 


then 


A. pia oad ey 
> An = X Pr_\(logx) + O(x 24+T log x) 


n<x 


where A= >> ;0;. Taking differences, we deduce that 
an = O(n2a41 logk~! n). 


In [129], this result is stated with a typo on page 525. (On lines 3 and 7 of [129], 
(2A — 1)(2A + 1) should be (2A — 1)/(2A + 1) in both instances.) Hardy [67] was 
aware of this argument, at least for m = 2, as is indicated in the tenth of his twelve 
lectures on Ramanujan’s work. 

A similar reasoning can be applied to obtain bounds in the Selberg eigenvalue 
conjecture. If x corresponds to a Maass form with eigenvalue A, then the Gamma 
factors in the functional equation of L(s, 7, rm) will have the following shape: 


m 


=} 1 
revmrm)=T1P(* 5 i), Aj =i(m — 2j)r, re 
j=0 


Using the conjectured analyticity of L,,(s) for all m and this information on the 
I’-factor for L,(s) now gives the Selberg eigenvalue conjecture. 


4 Eisenstein Series 


It is interesting to note that R.A. Rankin was a doctoral student of G.H. Hardy and 
had taken up Ramanujan’s conjecture for his doctoral thesis in 1933. The thesis was 
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a fundamental advance since it set the stage to what was to become the celebrated 
Rankin—Selberg method. This method involved an unfolding trick that generalized 
to the GL(n) setting as was noted by Jacquet, Piatetski-Shapiro and Shalika [87]. 

We present this method in its simplest setting. The earliest example of a Maass 
form is given by the Eisenstein series 


Ss 


eo 3 
|mz +n|?s ‘ 


1 
_ _-s 
E(z,s)=ma Ps); ) 
(m,n)A(0,0) 


This series converges for Si(s) > 1, and we clearly have 
E(yz,s) = E(z,5) 
for all y € SL2(Z). In addition, it is easily verified that 
AE(z,s)=s(1—s)E(z, 5) 


so that E(z, s) is a weight zero Maass form with eigenvalue s(1 — s). Since E(z, s) 
is periodic with period 1, we can derive its Fourier series: 


CO 


iG) =>) ane” 


r=—0o 


and 
1 . 
a,(y,s) = i E(@@etiy ae 7" dx. 
0 


We do the obvious. We insert the series expansion for F(z, s) into the integral and 
apply Fubini’s theorem. First, the contribution to E(z, s) from m = 0 is 


ma *I'(s)y*f(2s). 


This is part of ap(y, 5) but not all of ag as we shall see below. Now suppose m 4 0. 
Since (m,n) and (—m, —n) give the same summand in E(z,s5), we may suppose 
that m > 0. Thus, 


Cc CO 1 
ar(y, s)=n*T(s)ys S> / [(mx +n)? + m2y2] Se 2" dy, 
m=|n=—oo 0 
If we put n = qm-+d with 0 < d <m, the sum becomes 


& [o,@) 
», bs i [ (mx + d) + ome le as. 
[o,@) 


m=1d mod m* — 
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We change the variable x = u — d/m to get 


is (2 + yP) eran ~ oe) du. 


“20 d mod m 
The innermost sum is zero unless m|r in which case it is m. Thus, the sum becomes 
[o,@) 
= SP i 
Som! a (u? +y’) e 2miru du. 
m|r ee 


If r = 0, we get 


mz *I'(s)y*C(2s — vf. (u? + y)° du 


which is equal to 
na */a0(s —1/2)y'~5¢(25 — 1). 


Thus, the constant term (on applying the functional equation for ¢(s)) is equal to 
ao(y.s) =m P(s)S(2s)y* +21 ~ s)E(2 — 2s)yt™*. 
If r 4 0, then we get 


ay (y, 8) = ros (Ir) /¥ Ks—1/2(2zIrly) 
where 


1-2: 
Ce) = Sm 


m\r 


One can show that a,(y,s) =a;(y, 1 — s) and r’o_a.(r) = r-*o05(r), from 
which the functional equation is easily deduced. 


5 Eisenstein Series and Non-vanishing of ¢(s) on t(s) =1 


We want to indicate a proof of the non-vanishing of ¢(s) on ‘%(s) = 1 which uses 
the theory of Eisenstein series and as a consequence does not use the Euler product 
of ¢(s) as most conventional proofs do. The idea was used by Jacquet and Shalika 
[89] in their general result about the non-vanishing on {i(s) = 1 of automorphic 
L-functions associated with GL,. 

Recall that 


Ss 


a 
|mz +n|?s ‘ 


1 
__-s 
E(z,s)=a Ps); y 
(m,n)A(0,0) 
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Notice that we may also write this as 


1 
E(,8) =m (8) 56Qs) Yo My) 


VET o\L 


where I’, is the stabilizer of the cusp at infinity. 
We have already indicated that 


E(z,s)=a°I(s)¢(2s)y’ +25! F( — s)C(2 — 2s)y!-$ 


+ oo ro —25(Irl) VY Ks—1/2(27 |r ly)er7™ 
rA0 


where oy(n) = ain d” and 


dt 


K;(y) = sf —yt+17!)/2,s 
Se 2 0 7 t 


One can prove directly that K;(y) = K_s(y) and r’o_25(r) = r*o5(r), which 
allows us to deduce the functional equation of E(z, s) from its Fourier expansion. 

This result lies at the heart of the Langlands—Shahidi method of analytic contin- 
uation of Eisenstein series. It is also at the core of the Rankin—Selberg method of 
analytic continuation discussed in the next section. 

Now suppose that ¢(1 + if9) = 0 for some fg real. Then, ¢(1 — ito) = 0 also. We 
put s = (1+ %)/2 in E(z, 5). Then, the constant term vanishes, and we get a Maass 
cusp form: 


[oe oe) 
; ; a {4 dt 
E(z, d+ ito)/2) =4,/y y riot (r) cos@nrs) [ gee) F=f)?" 


r=l 


Using standard estimates for the integral, one can show that the sum is O(e~) for 
some c > 0. As the constant term of E(z, (1 + ito)/2) is zero, we have a genuine 
Maass cusp form on our hands. 

In particular, 


1 
/ E(x +iy, 1 +ito)/2) dx =0. 
0 


2 


Multiplying this equation by y*~ and integrating from 0 to ov, we get 


lee) 1 
/ / E(x +iy, (1 +ito)/2)y** dx dy =0. 
0 JO 
Now we use the basic idea that 


LJ vW\#) = 10, 11 x [0, cx], 


VET o\F 
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usually referred to as the “unfolding” of the domain of integration. Thus, 


dxd 
/ E(z, (1 + ito)/2)3(@)* ~S* =0. 
yW\A) y 


VET o\P 


As E(yz, 5) = E(z, 5), we may change variables and get: 


dxd 
o= / E(z, (1 + ito)/2)S(y2)5 - 
yelo\F I\H y 

dxd 
= E(z, (1 + ito)/2)E(@, 8) 
I\H y 


valid for all s € C. 
From the definition of E(z, 5) (or its Fourier expansion) we see that 


E(z,5) = E(z,5). 
Therefore, putting s = (1 — ifo)/2, we get from the penultimate equation: 
: 2dxd 
0= / |E(z, (1 + ito)/2)| = 
I\H 


Thus, the integrand is identically zero. That is, we have proved that ¢(1 + ito) =0 
implies that 


E(z, (1 + ito)/2) =0. 


We now show that this is a contradiction. We do this by showing that some Fourier 
coefficient of F(z, (1 + ito) /2) is non-zero. That is, we need to check 


- du 
—mry(u+u 1) 
I . u!tito es 


6 


If we set u = e”, we have to show that 


ae (ce? +e~9)—it90 
/ e mye +e Ito do #0. 
—cC 
In other words, it suffices to show that 
oe (09 -6 
/ e TV’ +e") cos 190 do £0. 
0 
This integral is of the form 


oe 6 -0 
i e +4) cos@ de, a>. 
0 
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We would like to determine its behaviour as y tends to infinity. To do this, we 
can apply Laplace’s saddle point method: if f has two continuous derivatives, with 
f() = f’(0) =0 and f”(0) > 0, and f is increasing in [0, A], then 


A ; a 
I(x) := / el dt~ 
0 2x f"(0) 


as x tends to infinity and provided that [ (xo) exists for some xg. A slightly general- 
ized version of this says that if g is continuous on [0, A], then 


‘ =xf(t) ;_ 7 
[ swe dt ~ g(0) mf" 


Now choose f(t) =a! + a~‘ — 2, g(t) =cost so that 


2 ee ) -—0 2 2 a 
e | e *@ +a" ~2) cos 6 dd ~ e~*2loga, | — 
0 Xx 


from which we see that E(z, (1 + ifo)/2) #0, as required. This gives the desired 
contradiction. It is possible to deduce the non-vanishing of the above integrals di- 
rectly without appealing to Laplace’s saddle point method. With some work, it may 
also be possible to derive a zero-free region for ¢(s). 


6 The Rankin-Selberg L-Function 

The unfolding technique employed in the previous section has wider ramifications. 
It can be used to establish the analytic continuation and functional equation for a 
large class of L-functions which fall under the umbrella of Rankin—Selberg theory. 


Let F : H > C be a J-invariant function which is of rapid decay (that is, 
F(x +iy) = O(y") for all N > 1.) Let 


1 
cry= [ F(x+iy)dx, y>0 
0 


be the constant term of the Fourier expansion. Let 


LF.s)= [ CE yy = 

0 y 

be the Mellin transform of C(F, y). 

Theorem 6.1 Let L*(F,s) =2~°I'(s)€(2s)L(F, 5s). Then, L(F,s) has analytic 


continuation to the whole complex plane, regular everywhere except for a simple 
pole at s = 1 with residue equal to 


3 
-[ F(z) dz. 
UT JI\H 
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The function L* (F, s) is regular for all s £0, 1 and satisfies the functional equation 


L*(F,s)=L*(F,1—s). 


Proof The key idea is to use the decomposition described earlier. We have 


lore) 1 
LiF.s)= f / F(x +iy)y® 7 dx dy. 
0 0 


Decomposing the domain of integration as in the “unfolding” technique, this be- 
comes 


5 aX dy 
4 ) F()y ae 
5 y(\H) y 


This can be rewritten as 


d xd 
ee F(yz)(3(y2))” oe ae F(z)(S(v2))’ “S 


VET o\F VET og\P 


because F is J”-invariant. Moving the summation inside the integral shows that this 
is equal to 


dxd 
/ F()E(z,8) 5. 
I\H y 


As E(z,s) has analytic continuation and functional equation, we get the same for 
L(F,s). 


We now give a few examples on how to apply this theorem. 

In the special case that f is a cusp form of weight k, we may apply the above 
result to F(z) = ia Lf (z)|?, which is easily checked to be J’-invariant. 

A straightforward computation shows that the constant term is 


fore) 
yk > la ea Oe, 
n=1 


The Mellin transform of the constant term is 


2 
i Ya 26 oe 2 = (An) 5-H (5 4k Ds a _ 


This proves the following: 
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Theorem 6.2 Let f be a cusp form of weight k for SLz(Z). If 


f(2) = Sage 


n=1 


is its Fourier expansion at infinity, then the Dirichlet series 
oo 2 

2 lan| 

n> 


has a meromorphic continuation to the whole complex plane. In fact, if 


lan 


w(s) =a 73-10-45 Ps) P(g +k pees) 


n=1 


then w(s) extends to a function which is regular for all s € C except at s = 1 where 
it has a simple pole and residue equal to 


a 
WT JI\H 


Moreover, y(s) satisfies the functional equation w(s) =wU —s). 


wee 


== f). 


If we apply the theorem of Chandrasekharan and Narasimhan [32] mentioned in 
an earlier section, we deduce that 


sla S= =¢F fox* + O(x*- 7”) 


n<x 


because twice the sum of the coefficients in the Gamma factors (or equivalently the 
degree in the sense of Selberg) is equal to 4. By taking a single summand in the sum 
on the left, we deduce that a, = O(n*/?—!/>). The same technique applied to Maass 
forms gives us ad, = O(n7/10), 

If we take f and g to be cusp forms (or even with one of them a cusp form), we 
consider 


y* f (2ge@ 


which is J’-invariant. If 


f= ome 


n=1 


and 


[o,e) 
ga) = Dp bien" 
n=1 
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are the respective Fourier expansions at infinity, then the constant term is easily 
computed to be equal to 


leo) 
k y p»—4any 

y ) anbne ae 
n=1 


One could also take forms of different weights k; and kz and consider 


yi tk)! ¢(z)9(z). 


In the end, applying Theorem 6.1, we deduce that 


admits an analytic continuation to the entire complex plane. If f and g are normal- 
ized Hecke eigenforms, then the series has an Euler product. This is essentially a 
consequence of Ramanujan’s identity 


(4 )(45)) ae 1 — zwq? 
A\z-1 J wat J? © =p 2) = way = 209) 


A suitably normalized version of this series L ¢ ,(s) (with appropriate J”-factors, 
¢(2s) and so forth) extends to a function which is regular everywhere except possi- 
bly at s = 1 where it may have a simple pole with residue equal to 


3 
a 


Thus, if f and g are orthogonal to each other, then the normalized series extends to 
an entire function. 
Kronecker’s limit formula states that 


tim] EC, s)- | = log(e” /47r) — 2log(/¥|n(z)|") 


where n(z) = q!/*4 To, -— 9g") with g = e-'z Tf f and g are Hecke eigen- 
forms with wy, 2g being the associated automorphic representations, the Kro- 


necker limit formula allows us to write down an exact formula for the special 
value L(1, wf @ 7g). 


7 Poincaré Series for SL2(Z) 


The Poincaré series for SL2(Z) are defined by 


1 ath 
Gr@=5 Dy (eta) tell care 
(c,d)=1 
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where the summation is over all integers c,d with (c,d) = 1, and a, b are any inte- 
gers such that ad — bc = 1. Observe that if r = 0, this reduces to the classical Eisen- 
stein series E;(z) (up to a constant). Thus, the Poincaré series are to be viewed as 
generalizations of Eisenstein series. It is easy to see that the inner summand does not 
depend on the choice of a solution. Indeed, by the Euclidean algorithm, any other 
solution for (a, b) has the form (a + tc,b + td), and 


ieee ENOE TE) ee ee, 
cz +d cz+d 


so that 


«( aEtb : b 
e2nir( Bath) = oni Ga). 


We can rewrite the series in a more invariant form by setting 


: b 


and then 


Ga Fay eee. 
yela\l 


The important thing to note is that G,(z) is a modular form of weight k. To see this, 
let 5 € F = SL2(Z). Then, 


Grbz= DY) fly, 82) Ferro). 
VET oo\P 


Now, we have the so-called cocycle relation 
J (V8, 2) = J (7, 62) 7 (6, 2) 
as is easily verified, so that 


J(vS, z) 
J (6, 2) 


i(y,6z)= 
and 


Gr(8z) = f6,2)* SY) flyd,z) hemo 
VET o\l 


= j(6,2)G,(2). 
Holomorphy is easy to verify using standard tests of complex analysis. In ad- 


dition, G;(ico) = 0 if r > 1. We conclude that for every r > 1, G,(z) is a cusp 
form of weight k. Thus, Poincaré series give explicit constructions of cusp forms. 
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For a detailed treatment of this theory, we refer the reader to Rankin’s book [171] 
(especially Chap. 5). 

Now let f be any cusp form of weight k. We would like to compute the inner 
product (f, G;). 

First observe that e27!% = e27!* . e~ 2) so that 


e2niz =e 2mix e 2ny at 2mi(Z) 


Thus, 


(f, Gr) = i > fee 2 ez Fay*yt dx a 


A vyery\P 


esd A gd 
= a YY ezt+ at f@errr 7 _ SY 


\H Pr lez+d|**  y 


— Dy i: flyaxyake 2) SO 
jer y 


= ‘S i f (23(z)ke7 272 dx - 
y\A) 


yVEloo\P 
1 
= ea f(x a Wwe te dx dy : 
0 JO y? 
Now from the Fourier expansion of f(z) we have 


[ee 


f (2) = ae . en lmny. 


n=1 


We see that the x-integral picks up the rth Fourier coefficient. Thus, 


CO 
(f, G;) = a, [ gy ye dy. 
0 
By setting 4zrry = ¢ in the integrand and simplifying, we deduce the following: 


Theorem 7.1 Let f be any cusp form of weight k for I’. Then, 


Pies he: 


ie agar 7 en = 


An important corollary is: 


Corollary 7.2 Every cusp form is a finite linear combination of Poincaré series 
G;(z). 
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Proof The set of Poincaré series spans a closed subspace in the space of cusp forms. 
If f is a cusp form not in this space, all of its Fourier coefficients must vanish by 
the previous theorem. Thus, the orthogonal complement is zero. 


As an example, consider the case k = 12. Each of the G,(z) is a cusp form 
of weight 12. But any cusp form of weight 12 must be a constant multiple of A, 
Ramanujan’s cusp form. Thus, 


G;(Z) = cr A(z). 


What is c,? By the above theorem, 


rdil 
(A. Gr) = EOP = ora. 
Hence, 
(4xr)!! —— dxdy 
tr)= 5 [.,°soG 


which is an interesting formula for the Ramanujan t-function as an inner product. 


8 Fourier Coefficients and Kloosterman Sums 


Emanating largely from the work of Petersson [154] in the 1930s and Selberg [179], 
an explicit formula can be given for the Fourier coefficients of G;(z). This strik- 
ing formula involves the Kloosterman sums, and their appearance has opened new 
connections to the Selberg eigenvalue conjecture as well as applications to classical 
questions of analytic number theory. We now derive this remarkable formula. 

We begin by writing 


[o,e) 
Gr) = Dogme". 


n=1 


Then, 


1 
em= Gane, 
0 


More precisely, for reasons of convergence, we should consider 


1+i6 y 
y G (z)e 27 ine 
id 


with 5 > 0, but we leave this technical modification to the reader. We have 


1 : (ax tb 5 
&rn = 2 3 / (cx + dk et Sgg)— 2m inx dx. 
(c,d)=1" 9 
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Put cx + d =f. The argument in the exponential becomes 


d t 
“(<u-a +0) ve iat t+ar on r 
tile Cc 


Cc Cc tc 
since ad — bc = 1. Thus, 


1 1 22! (nd-+ar) se —k ,— 2 (L4nt) 
Sin = 5 > - \ ec ae mea dt 
Cc. 


c4#0 d(mod c) = 
# ad=1(mod c) 


because 


: k 221 (nd+ar—nt) ,— 22" 
2 t “ee : e ¢ dt 
0 


(c,d)=1 


depends only on d (mod c). Writing d as dy + (m+ 1)c with varying m, we transform 
the integral from 0 to | into an integral from —oo to oo. This integral turns out to 
be a Bessel function: 


oot+ci oni 
/ g exp(-= (¢ + mt) Jar = 2n(n/r)*—-Y/? 1 (42/rn/c) 
—ootci c \t 


where 
no= = | i ee ge 
Qi C 


where C is the unit circle. The sum 


2ni 
S(r,n,c):= Se go rar) 


d(mod c) 
ad=1(mod c) 


is called a Kloosterman sum. Using this notation, we obtain the beautiful formula 
due to Petersson: 


00 
= S(r,n, Cc) An ./rn 
§rn = (n/r)® mie brn +10 ) sO ha ) 
ne Cc Cc 


where 6,, denotes the Kronecker delta function. 
We have already noted that the Poincaré series span the space of cusp forms. 
Thus, to prove the Ramanujan conjecture, it suffices to show that 
k=1 
8rn = O(n 2 she 
for every r. This is tantamount to showing that the expression in parentheses in the 


above sum is O(n‘). 
Selberg, using this expression and Weil’s estimate for Kloosterman sums: 


|Si7,n,0)| <d(c)e!? 7,0, 0)'” 
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as well as the bound 
Je-1(x) < Amin(x‘"!, se) 
obtained that 
rn = O(nk/2-14te), 


Note that this is better than what we obtained earlier by the Rankin—Selberg method. 
Since the estimate was obtained crudely, Selberg felt that there must be cancellation 
among the Kloosterman sums. This led him to formulate the following conjecture 
(which was also arrived at independently by Linnik). 


Conjecture (Selberg—Linnik) 


Ga):=)> ane = 0(x*) 


CHx 
for x = gcd(r, n)!/?*€ for any € > 0. 


In his 1965 paper, Selberg stated that this would lead to a proof of the Ramanujan 
conjecture (for Maass forms as well) but did not indicate a proof. We will indicate 
below how such a proof can be obtained for the full modular group. The argument 
is adapted from a paper of R. Murty [129]. 

Let us first observe that Weil’s estimate for Kloosterman sums leads to the esti- 
mate 


G(x) = O(x!/? logx) 


for (r,n) = 1. Kuznetsov [102] proved that G(x) = O(x!/6+€), but the O-constant 
depends on r,n, and so it is not applicable to the estimation of the Fourier coeffi- 
cients. Let 


H(x) := Y> S(r.n,c). 


C<x 


By partial summation, the Selberg—Linnik conjecture is equivalent to 
Ho) SOW); 


We begin by considering 


S(r,n,c) (= *) 
i, (ee 
c c 


c>J/n 
a Gif (eg el 2) 
c>J/n . 
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by partial summation. By the mean value theorem, the expression in parentheses is 


4n./rn 
AEE Ix Ec) 

for some & € (47./rn/(c + 1), 41./rn/c). Using the estimate 
peo ce a 


we get 


Sn, c) | 4n/rn Helos, 
> Z Je (= )< nl » aie 
>/rn c>J/n 


by the Selberg—Linnik conjecture. Thus, we need only consider 
S(r,n, Cc) 4r./rn 
yea 
Cc Cc 
cx. Jn 


To estimate this, we apply an inductive argument. As there are no cusp forms of 


weight 10, we have 
y S(r,n, o) y »( =") - O(n). 
c 
c<J/n 


So, if for example, we were trying to establish the conjecture for k = 12, then it 
suffices to estimate for k = 10 the quantity 


S(r,n, Cc) 4n/rn 4 ./rn 
~ ——— J Je Jx-1{ —— ] ff. 
c c 
c<J/n 
By the familiar identity 


2k Jeo) — Trai (x) + Je—-1 (x) 


it suffices to estimate 


4n./rn 
a S(r,n, c) Jk 
= 
c<J/n 
Again, by partial summation, we may write this as 


Jn Enola) af] 


cx. Jn 
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Again, the expression in the brackets is 


An /rn 
c(c+ 1) 


I Ec): 


Using the estimate 


LO<a 


as before and the fact that H(c) = O(c!t*), we deduce a final estimate of O(n‘) 
as desired. This completes the proof of the fact that the Selberg—Linnik conjecture 
implies the Ramanujan conjecture (for the full modular group). A similar argument 
can be applied to higher levels. However, the non-existence of cusp forms of small 
weight is not guaranteed. In this case, we exploit the fact that we know the Ramanu- 
jan conjecture in the weight two case (a result due to Eichler and Shimura). 


9 The Kloosterman-Selberg Zeta Function 


In order to gain more insight into the Selberg—Linnik conjecture, we will consider 
(with Selberg [180]) the series 


Z(r,n,s)= > BU): 


Ic | 2s 
c#0 


To study this series, Selberg [178] considers the cognate Poincaré series 


Un(z,5) = Ss Gd pe 
Doo\P° 


Clearly, U;, is [’-invariant. Moreover, if 


then 
AU, (z, 8s) =s(1— s)U,(z, 8s) t4anUy(z, 5 + Is. 


One can show that the Fourier expansion of U,,(z, s) contains Z(r, n, s) in it. This al- 
lows us to relate the eigenvalues of A with the abscissa of convergence of Z(r,n, 5). 
More precisely, 


CO 


Un(z,s)= )) Br(m,y, se" 


m=—-C 


where 
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S —2mny 


Bn(m, y,8) = se e 


ae S(m,n,c) ys 
 ef2s 


r ( er 2nn ) dv 
x ex —ZITIMYU i 
oon By i) (14) 


It then turns out that 


(2a Jam) yee 2 
c=1 
__ sinzs = aj(n)aj(m) a 5¥ =, me ae ; 
a dX Soha: r(s stir)r(s 5 ir) 
nm I'(s) 1. f® i h(r, s) 
2x FU—s) =f tajm O2ir (m)o—2ir (Mm) ye r 


where 


sins 1, 1, 
h(r,s) = 5 eS te oe leas 


and the a;(n)s are the Pounce coefficients of the Maass form corresponding to the 
eigenvalue A; = 1/4+ r?. This remarkable formula establishes a striking relation- 
ship Hetween: the eigenvalues A; and the Kloosterman—Selberg zeta function. 


10 Rankin-Selberg L-Functions for GL, 


Let GL (R) be the group of 2 x 2 matrices with real entries and positive determi- 
nant. The upper half-plane can be realized as a symmetric space of GL} (R). More 
precisely, if Z denotes the subgroup of scalar matrices and SO(2) denotes the spe- 
cial orthogonal group, then the upper half-plane can be realized as the coset space 
GL} (IR)/ZSO(2). Modular forms then can be viewed as functions on GL, (R) that 
transform appropriately under certain subgroups. Once we view the classical theory 
from this perspective, it affords a vast generalization. We refer the reader to [36] for 
details. 

For the moment, it will suffice to say that classical Hecke eigenforms correspond 
to certain representations of GL2(Aqg) where Ag denotes the adele ring of the ratio- 
nal number field. More generally, one can consider cuspidal automorphic represen- 
tations of GL, (Ag). Such representations 2 can be decomposed as a restricted ten- 
sor product oo @ (@’tp), and one can associate parameters [11,00,.-- 5 [n,oo tO Moo, 
and a1, p,..., Qn,» called Satake parameters (for all but finitely many primes p). The 
Ramanujan conjecture for GL, is the prediction that |q@;,)| = 1 for all primes p and 
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all | <i <n. The Selberg eigenvalue conjecture translates into the assertion that 
RU j,co) =O forl<j<n. 

Many of the methods we discussed in the earlier sections can be developed to 
deal with the higher rank case. We refer the reader to [36] for further details. 

With reference to the Rankin—Selberg method, the general theory for GL, was 
initiated and developed by Jacquet, Piatetski-Shapiro and Shalika [87], Shahidi 
[187] and finally completed by Moeglin and Waldspurger [125]. If 2; and zy are 
cuspidal automorphic representations of GL, and GL, of the adele ring over the 
rationals (say), then the Rankin—Selberg L-function is defined by the Euler product 


L(s, 7 @ m2) = I] L(S, %1,p @ %2,p) 
P 
where for all but finitely many primes p, the Euler factors are given by the formula 


(1) 2) 


Ppt ie oc 
Lis, T1,p @ 12, p) = Hl: _ 2 


ij 


and 


for r = 1, 2. Itis possible to define the Euler factors at all the places so that the final 
product converges for i(s) > 1. The completed L-function turns out to be entire 
unless 


1) =~ 1" @ | det |! 


for some real number f, in which case the function is regular everywhere except at 
Ss = 1 —it where it has a simple pole. 

One can, of course, consider L-functions of higher tensor powers. These L- 
functions contain finer information about lower rank L-functions, and it would not 
be an exaggeration to say that the analytic theory is still in formation. 


Chapter 5 
The Circle Method 


1 Introduction 


In his early letters to G.H. Hardy, Srinivasa Ramanujan wrote that “the coefficient 
of x” in 


(Ope) 


sinh ./n 
m/n 

I have not written here the forms of F and f as they are very irregular and compli- 

cated and their values are very small. Hence, the coefficient is an integer very near 


to 
1 sinh ./n 
—|coshx/n— ——~ ), 
i (e ay ) 


and not always the nearest integer, as I hastily wrote to you before.” The significance 
of this passage is that Ramanujan was writing about the explicit determination of the 
Fourier coefficient of the reciprocal of a modular form of weight 1/2. We see from 
this excerpt that he already had some intuition of the circle method even while he 
was in India and before he met Hardy. Indeed, in his letter, Ramanujan calculated 
the nth coefficient of the reciprocal of a theta series which is of weight 1/2. In 
the case of the partition function, one needs to calculate the nth coefficient of the 
reciprocal of the Dedekind 7-function, which is also a modular form of weight 1/2. 
The circle method would become crystallized in their epoch-making paper [71] on 
the partition function written in 1918. We will make further remarks later on this 
point. 

The idea of the method can be explained very easily. Suppose that we have a 
set A of numbers and we are interested in counting the number r;,4(”) of ways a 
natural number n can be written as a sum of k elements in A. It is natural to form 


(cosh vi - ) + F(cosaJ/n) + f (sinaJ/n). 
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the generating function 


Fa(x) = ae 


acA 


and recognize that the coefficient of x” in the power series expansion of F'4(x)* is 
precisely rg,4(). However, we cannot proceed further unless we have some addi- 
tional information on the function F'4(x). This is often difficult and in many cases 
it is unreasonable to expect any further information. If, however, we look at the 
trigonometric series 


fa(0) = ~ e2miae 


acA,a<n 


then, by elementary Fourier series, we have 


1 
nan) = | fake 27"? ao. 
0 


Again, we cannot proceed unless we know something about /4 (6). But in this case, 
we exploit the continuity of f4 and realize that if we know the value of f,4(a/q) 
for a rational number a/q lying in the unit interval, then we know (by partial sum- 
mation) the behaviour of f(@) in a small neighborhood of a/q. The strategy then 
is to break up the unit interval (the circle) into “small” intervals around rational 
numbers (called major arcs) where the behaviour of f4(0) is “known” and a com- 
plementary set of intervals (called minor arcs) and expect that the contribution from 
the major arcs dominates. In other words, one uses an alternate method to deal with 
the minor arcs to deduce that the contribution is small in comparison with the con- 
tribution from major arcs. This often is the difficult part in the application of the 
method. However, the method is sufficiently powerful that it enables one to make 
very precise conjectures on the expected behaviour of rx,,4(m) in many interesting 
cases. Indeed, the behaviour of f(a/q) relies on our knowledge of the distribution 
of the elements of A in residue classes modulo q. For example, in the case that A is 
the set of prime numbers, rx, 4 (7) is the number of ways of writing n as a sum of k 
prime numbers. Then, we have 


q-1 
fa(a/q) = Sereria — yor iablaa(n, qs b), 


psn b=0 


where z(n,q,b) is the number of primes p = b (mod q) with p <n. From the 
theory of classical Dirichlet L-series, one has a great deal of information about 
z(n,q,b), and this information can be injected into the formula above to derive 
formulas for the number of ways of writing n as a sum of k primes. For instance, in 
the case of the binary Goldbach problem, the major arc calculation predicts that the 
number of ways an even number n can be written as a sum of two prime numbers is 
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asymptotic to 


—1 
Sines p- log*n 


where C is the “twin prime constant” given by 


I('- a=) 


p23 


and both products are over prime numbers p. 

Hardy and Ramanujan developed this strategy to derive an asymptotic formula 
for the partition function in their epic paper of 1918. After Ramanujan’s sudden 
demise, Hardy and Littlewood developed the method further and applied it to two 
classical problems, namely, Waring’s problem and the Goldbach problem. In this 
chapter, we will discuss briefly the Hardy—Ramanujan work on the partition function 
which laid the foundations of the circle method, and then outline how the method 
can be applied to Waring’s problem and the Goldbach problem. In this direction, 
we take an approach due to Linnik and Hua in our treatment of Waring’s problem, 
where the circle method is needed only at the end. In the case of the Goldbach 
problem, we essentially follow Hardy and Littlewood in their analysis of the ternary 
Goldbach problem where they assumed the generalized Riemann hypothesis to get 
the necessary estimates on the minor arcs. 


2 The Partition Function 


The function p(n) is the number of ways of partitioning n. For example, the number 
4 can be partitioned as 1+ 1+1+1,orl1+1+2, or2+ 2, or 1 +3, or simply 4 
so that p(4) = 5. The reader should note that we do not count the orderings. Thus, 
the partition | + 1+ 2 is the same as 2+ 1+ 1. From the definition it is easy to see 
that each partition of 7 can be “factored” as 


nak +2k. +3k3+--+nkn, 


where k; counts the number of times 7 occurs in the partition. From this observation, 
we deduce that p(n) has the generating function given by 


> p(n)x” = [[a — ad ae 
n=0 k=1 


Why is p(n) interesting? Firstly, the function occurs naturally in many places. 
For instance, if S$, is the symmetric group on n letters, then the conjugacy classes are 
in one-to-one correspondence with the partitions of n. This observation also leads 
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to an explicit description of the irreducible representations of S,. Surprisingly, par- 
titions play a fundamental role in the representation theory of general linear groups. 
They also have a dominant role in combinatorics and theoretical physics. 

In 1918, Hardy and Ramanujan discovered, using the circle method, the asymp- 
totic formula 


exp(z /2n/3) 
AnJ/3 


as n tends to infinity. In 1937, Rademacher [157] derived the following asymptotic 
expansion: 


p(n) ~ 


sinh(4 F(n — 
p= 5 VEO =| hd zi 
where 
Ax(n) = SS erilscn,k)—2nm/K]. 
Osm<k;(m,k)=1 
and 


gminb 


Cc 


1 oan 
S(b,c) = ac Y “cot C cot 
n=1 

It seems that Selberg [181] had also arrived at this formula but never published it. 
He wrote, “In the summer of 1937, I had actually myself been studying number 36 
in the Collected Papers of Ramanujan and had arrived at [Rademacher’s] formula 
.. It always seemed strange to me that Hardy and Ramanujan did not end up with 
this formula ... and I believe firmly that the responsibility for this rests with Hardy.” 
Recently, Bruinier and Ono [28] discovered a new “exact” formula for p(n). This 
formula expresses p(n) as a finite sum of special values of a “weak” Maass form 

evaluated at CM points in the upper half-plane that have discriminant —24n + 1. 

To be precise, let 


Ex(q)=1—24)\a(n)q", o(n)= od, quent 


n=1 d|n 


be the “quasimodular’” Eisenstein series of weight two. Let n(z) be the classical 
Dedekind eta function of weight 1/2. Define 


Eo(z) — 2E2(2z) — 3E2(3z) + 6E2(6z) » 1 
2n(z)2n(2z)2(3z)2n(6z)2 = 


One can show that f(z) is a weight —2 meromorphic modular form on J(6). Now 
let 


a= 10 — 29q — 104q?---. 


Pte ae ) 
@i= (= sta f@. 
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This is a weak Maass form of weight zero. It is an eigenfunction of the hyperbolic 
Laplacian 


with eigenvalue —2. Now consider all positive definite integral binary quadratic 
forms Q(x, y) =ax* + bxy + cy? with discriminant b? — 4ac = —24n + 1 and 6Ja. 
The group J9(6) acts on such forms in the obvious way, and we let Q,, be the set of 
these representatives with a > 0, b= 1 (mod 12). Let 


_ —b+ V1 =n 


“Oo 2a 


Then, the theorem of Bruinier and Ono [29] is that 


1 
eee eC) 


€Qn 


p(n)= 


This formula was announced in early 2011, and upon seeing this, a natural idea 
that arises is to see if one can derive the Hardy-Ramanujan asymptotic for p(n) 
from this algebraic formula. This indeed can be done, and the details will appear in 
a forthcoming paper of M. Dewar and M.R. Murty [42]. 

As noted by Hardy and Ramanujan, p() satisfies the simple recursion formula 


np(n)= oh S* p(n—hk). (26) 


h=1 k<n/h 


To see this, we simply list all the partitions of m and count up all the numbers in this 
listing in two ways. The sum of all the numbers is clearly np(n) which represents the 
left-hand side of (26). On the other hand, for each natural number h with 1 <h <n, 
we can count how often A occurs in the listing. Indeed, the number of partitions 
having at least one h is clearly p(n — h). Those having at least two hs is p(n — 2h). 
Thus, the number having exactly one h is 


p(n —h) — p(n — 2h). 


Thus, 


n 


np(n) = So fa(p(n —h) — p(n—2h)) + 2h(p(n — 2h) — p(n — 3h)) +--+} 


h=1 
n 

= doh DE pa —hk), 
h=1 k<n/h 


as claimed. 
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This simple recurrence can be used to derive a weaker result than the Hardy— 
Ramanujan theorem. In 1942, Erdés [48] indicated how the recursion can actually 
be used to derive the Hardy—Ramanujan asymptotic formula for p(n). However, 
he was not able to evaluate the constant 1/4./3. This was done later by Newman 
[143] in 1951. Though the proof is elementary, it is by no means simple. We refer 
the reader to the papers by Erdés and Newman for further details. Here, following 
Hardy and Ramanujan, we show that the recursion and simple induction can be used 
to prove that 


log p(n) ~ 1V/2n/3. (27) 


We proceed in two stages. First, we show that 
p(n) <e",  a=n,/2/3. 
Second, we show that for every € > 0, there is a constant C, such that 


1 
p(n) > a 


€ 


These two inequalities establish (27). 
For the first step, we clearly have p(1) = 1 < e*, and we may suppose that 
p(m) < e“v™ has been established for m <n. For the recurrence (26), we have 


n nfh 


np(n) < » > he, 


h=1k=1 
We use the elementary inequality 
(l—x)" <1—ux, 
valid for any 0 < x < 1 and u > 0, to deduce 


n njh 


n oo 
np(n) < > ee < evn Se yo ie e, 


h=1k=1 k=1h=1 


Using the familiar inequality sinh x > x and 


CO 
—MxX 1 1 
dome ™ = 7 <a) 
4sinh*5 xX 


m=1 


we deduce 


from which the estimate follows. The lower bound is derived similarly. 
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The central observation in the Hardy-Ramanujan paper is that the generating 
function 
[o,@) [o,@) 
Y > pn)x" = [[@ — ro 
n=0 n=1 


becomes, upon setting x = e?7!T 


modular form of weight 1/2. 
More precisely, if 


, related to the Dedekind n-function, which is a 


CO 


F(x)=]][Q-2")", 


n=1 


setting x = e?7'T, we see that F(x) = e?7!*/!? /n(t) where 
CO 
n(t)=q'/™* T [(1-q"), q=eentr, 
n=1 


is the Dedekind n-function. Now, 7 is a modular form of weight 1/2, and in partic- 
ular, 


n(—1/t) = Vt/in(t), 


so that this translates as 


Fe") = alee OEE ee): 


1 
viny= vee ( (2-1), 


then as t tends to zero, e~2* tends to 1 and e~2/* tends to zero, so that F (e 
tends to 1. Thus, for x sufficiently close to 1, F(e~2"*) is very close to w(t). It is 
therefore reasonable to expect that for t close to zero, F(e~2") is approximated 
by y(t). By Cauchy’s theorem, 


a 1 i FQ) | 
n)=>=— x 
i 20 i cxntl : 


where C is a circle around x = 0 of radius < 1. Equivalently, changing variables, 
we have 


In other words, if 


—2n/T) 


1/2 F 
p(n) I Re ee dt, 
—1/2 


so that if we insert our approximation for F(e~*"*), we get 


1/2 ; 
p(n) ~ w(rer'" dr, 
1/2 
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This integral is not too difficult to evaluate and is of “Bessel type.” One finds that 


1 a Jn 2 

in its , a=nV/2/3. 
Of course, the approximation of F (e?”*) by w(r) has to be quantified. This is pre- 
cisely what Hardy and Ramanujan do in their epic paper. The modular transforma- 
tion property allows one to derive the asymptotic formula for p(7). This is the basic 
strategy. The details of the paper are not simple. But the idea is. 

Shortly after Ramanujan fell ill and returned to India, Hardy and Littlewood rec- 
ognized the potential of the circle method in treating other problems in number the- 
ory. In a series of remarkable papers, they addressed Waring’s problem and Gold- 
bach’s conjecture. We discuss these in the next two sections as they illustrate the 
versatility of the method. 


p(n) ~ 


3 Waring’s Problem 


In 1770, Edmund Waring, in his book “Meditationes Algebraicae” made the pro- 
found statement (without proof) that every natural number can be written as a sum 
of at most four squares, nine cubes, nineteen fourth powers and so on. More pre- 
cisely, this observation, called Waring’s problem, is stated as follows: 


Conjecture (Waring’s problem) For each k > 2, there exists a positive integer g = 
g(k) > 2 such that every natural number n can be written as a sum of g(k) kth 
powers. That is, for every n = 1, there exist non-negative integers x|,X2,...,Xg 
such that 


naxttxyt-. 4x8. 


We note here that g(k) is chosen to be the minimal number with the above prop- 
erty. That is, one can find at least one n = | which cannot be written as a sum of 
(g(k) — 1) kth powers. 


In 1909, Hilbert [80] proved that Waring’s observation is true. However, his proof 
only showed that g(k) exists but gave no method to effectively find it or bound it. 
Thus, we are led to the following questions. 


(1) Can we find a precise formula for g(k) for all k? 
(2) Can we find a formula for g(k) that works for sufficiently large values of k? 
(3) Can we determine the asymptotics, lower bounds or upper bounds for g(k)? 


In 1772, J.A. Euler, the son of L. Euler, conjectured [52] that 


k 
g(k) =2* + BE = 
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In 1936, Pillai [156] showed that Euler’s conjecture is true, provided that 


AQ HQ) 


By the work of Mahler [119] in 1956, it is known that there can be at most finitely 
many values of k for which the above inequality does not hold. It is conjectured that 
the above inequality holds for all k. 

By the celebrated theorem of Lagrange, g(2) = 4. In 1909, Wieferich [202] 
showed g(3) = 9. A gap in his proof was filled in later by Kempner [92] in 1919. 
The fact that g(4) = 19 is a recent theorem of Balasubramanian, Deshouillers and 
Dress [14, 15]. Earlier, Chen showed in 1964 that g(5) = 37, and Pillai [155] in 
1940 proved that g(6) = 73. There are still many open problems in the theory. For 
instance, it is conjectured that every sufficiently large number can be written as the 
sum of four cubes. 

It is possible to prove the existence of g(k) by using elementary methods. This 
was first done by Linnik in 1943 using fundamental ideas of Schnirelmann. In this 
chapter, we outline Linnik’s elementary solution of Waring’s problem combined 
with a theorem of Hua on exponential sums. Then, we indicate how to use the circle 
method to derive an asymptotic formula for the function rg ;(n), the number of ways 
of writing n as a sum of g kth powers. In this way, the circle method comes in only 
at the end to enable us to derive the asymptotic formula for the number of such 
representations. 


3.1 Schnirelmann Density 


Let A CN, and for every n > 1, let A(n) = #{a € A: a < n}. We define the 
Schnirelmann density of A, denoted 6(A) to be 


8(A) := infy> mg 
n 


We observe that A(n) > 6(A)n for all n > 1. We also observe that 0 < 6(A) < 1 and 
6(A) = 1 if and only if A=N. 

The Schnirelmann density is different from the asymptotic density a (A) defined 
as 

A 
o(A)= lim a 
n>o n 

While o (A) measures the asymptotic behaviour of Aw) for arbitrarily large values 
of n, the Schnirelmann density 5(A) is affected by the first few values of n. For 
example, if E and O denote the set of even and odd natural numbers respectively, 
then 6(E) = 0 and 6(O) = 1/2. On the other hand, o(E) = 0 (O) = 1/2. 
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Given two sets A and B of integers, let A-+ B denote the sumset of A and B, that 
is, A+ B={a+b:aeA,be B}. If A and B are subsets of N, let A ® B denote 
the sumset of A U {0} and B U {0}. That is, 


A®B:=AU{0}+ BU {0}. 
The following remarkable theorem was proved by Schnirelmann in 1936: 


Theorem 3.1 (Schnirelmann, 1936) For any two subsets A and B of N, 


5(A @ B) = 5(A) + 8(B) — 8(A)3(B). 


Proof Suppose that A(n) =r and the elements a; <n of A are ordered as | < aj < 
az < a3 <::: <a, <n. Let 


B,:={be B:b <ayj}, 


By:={be B:aj-1+b<aj} if2<i<r 
and 
Bra i={be Bia- +b <n}. 


Observe that the sets {a),a2,...,a-} and A+ B; for | <i <r +1 are disjoint sub- 
sets of (A @ B), and each element in these sets is < n. For notational convenience, 
let us define B(O) = 0. We have 


r+l 
(A@ B)(n) = A(n) + 4B; 
i=l 
> A(n) + Bla — 1) + 2 B(@j — aj-1 — 1) + Btn —a,). 
t=? 


Combining the above inequality with the property that B(n) > 5(B)n, we get that 
for every n > 1, 


(A ® B)(n) > A(n) + 6(B)} (a — 1) + )OG@ — G1 - I) + (a— eo] 
i=2 


= A(n) + 6(B)(n —r) 

= A(n) + 6(B)(n — A(n)) 

= A(n)(1 — 5(B)) + 6(B)n 

> 6(A)n(1 — 5(B)) + 5(B)n 

= n(8(A) + 6(B) — 6(A)5(B)). 


This proves the theorem. 
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A more general version of Schnirelmann’s theorem can be stated as follows: 


Theorem 3.2 For Aj, A2,..., A; CN, 


t 


(@ a) >1-]](1-6(Aj)). 


i=1 


Proof We have already proved this theorem for tf = 2. A simple inductive exercise 
proves it for all t > 2. 


For any m € N and A CN, we denote 
m 
mA:=QA. 
i=l 


Lemma 3.3 /f 5(B) > 1/2 for some B CN, then 5(2B) = 1. In other words, 
2B=N. 


Proof Let n € N. We will show that n € 2B. If n € B, then we are done. If n ¢ B, 
let B, ={b € B:b <n} and B) ={n—b: be B,}. Clearly, #B, = #B) = B(n), 
and B, U BY C {1,2,...,n}. Thus, #(B, U B)) <n. 

Since 6(B) > 1/2, we get 

n 
#B, = B(n) =#B), > 5" 

Let us assume that B, and BY, are disjoint. This implies n < #(B, U B)), which 
contradicts the fact that #(B, U B/,) <n. Hence, our assumption is false, and B, and 
By, are not disjoint. In other words, there exist bj, by <n € B such that b} =n — bo, 
that is, bj + by =n. This proves that n € 2B. 

This proves the lemma. 


The following theorem of Schnirelmann connects the concept of Schnirelmann 
density with Waring’s problem: 


Theorem 3.4 /f A is a subset of N such that (A) > 0, then there exists m € N such 
that d(mA) = 1. 


Proof If 5(A) = 1, we are done. If not, we have 0 < 6(A) < 1. Since 0 < 1—6(A) < 
1, we may choose ft large enough so that 


1 
1-8(A))' < x. 
(1-3(A))' <5 
By Theorem 3.2, we see that 


8(tA) > 1—(1—4(A))' > - 


Theorem 3.4 follows as a quick application of Lemma 3.3. 
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In 1942, Mann [120] proved that Schnirelmann’s theorem can be refined to show 
that 6(A @ B) > min(6(A) + 6(B), 1). 


3.2 Schnirelmann Density and Waring’s Problem 


Schnirelmann introduced these ideas to attack Goldbach’s conjecture and indeed, 
using elementary sieve methods, showed that there is a constant co such that every 
number can be written as a sum of at most co primes. Many mathematicians have 
successively improved upon this constant cg, and at the moment, cp = 7, a result 
due to Ramaré [168]. By the same methods, one can show that every even number 
is the sum of at most six prime numbers. It is remarkable how far such an elementary 
method can be pushed. 

In 1943, Linnik applied Schnirelmann’s method to attack Waring’s problem. We 
discuss this now. 

Let k > 2 and Ax = ix" x € N}. We observe that 


1/k 
1 2 Ax(n) Z n 


n n 


for every n > 1. 


Now, 


1/k 


5(Ay) <0 (Ay) < lim “— =0. 
n—>oo n 


Thus, 5(Az) = 0. In view of Theorem 3.4 of Schnirelmann, if we could show that 
d(mAx) > 0 for some m > 2, then for some g = g(k) > 1, Bi, Ag =N. This 
would solve Waring’s problem. 

For positive integers g and m, let rg ,(m) denote the number of non-negative in- 
tegral solutions of the equation ar + x feeet x = m. Observe that if this equation 


holds, then 0 < x; < m!/ for each 1 <i < g. Thus, 
re gm) < (m'/* + 1)8 <, m8!" 


The following theorem of Linnik, which we will prove in the next section, shows 
that we can find some g for which the above estimate for rg ,(m) can be sharpened. 


Theorem 3.5 For any k €N, there exist g € N and a constant c(k) depending only 
on k such that for allm > 1, 


re.e(m) < clk)mi—!, 


With the help of Linnik’s theorem, we prove the following theorem: 


Theorem 3.6 For any k €N, there exists g € N such that 6(g Ax) > 0. 
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Proof By Theorem 3.5, there exists g € N such that 


n n n 


S>oree@n)<s SS ckymth<c(kynt-! S* 1.28) 
m=0 m=0 m=0 
rg .x(m)40 rg,k(m)#O rg k(m)#O 


We also observe that for each | <i < g, if 


ni/k 
O<x< gilk? 
then, 
ae <n 
i=l 
Thus, 


Yo ois YO reu(m). (29) 


X1 XQ, Xe m=0 
osyenl rxtx 
Observe that 
nilk ‘ eS nilky 8 
» gi/k = = 1/k 
X1,XQ, +, Xg 
i/k 
OSX STE 


and 


n 
S> 1 =(gAg)(n). 
m=0 
rg x(m)#0 


Combining these observations with the inequalities in (28) and (29), we get 


nil/k\ & bj 
(<7) <c(k)nk“(gAx)(n) for every n> 1. 


This implies that there exists a positive constant C(k) > 0 such that for every n > 1, 


(gAx)(n) 2 C(k)n. 


Hence, 5(gAx) > 0. 


Thus, Linnik’s theorem solves Waring’s problem. In the next section, we prove 
Linnik’s theorem. 
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3.3 Proof of Linnik’s Theorem 


We start this section with a basic lemma on linear equations. 


Lemma 3.7 For a positive integer n, let q(n) denote the number of integer solutions 
(x1, x2, ¥1, Y2) of the equation 


x1yi+x2y2=N (30) 
such that |x;| < X and |y;| < Y. Then 
q(0) « (xY? 


and 


q(n) « Gas *) forn>1. 


d\n 


Proof We first consider the case where n = 0. Clearly, x1, x2 and y; can take at 
most 2X + 1, 2X + 1 and 2Y + 1 values respectively. Once these are chosen, y2 can 
take at most one value. Thus, 


q(O) < (2X +1)°(2¥ +1) « X’Y. 


Similarly, q(0) « XY?. Thus, 


q(0) « min{ X?Y, XY?} « VX2¥.xX¥? « (xY)*”, 


We can do better when n 4 0. We assume, without loss of generality, that X < Y. 
Let gi(n) be the number of integer solutions to x; yj + x2y2 =n such that |x2| < 
|xi| < X and |y;| < Y. This ensures that x; 4 0. Otherwise, we would get x2 = 0, 
which implies that n = 0. 

Let us start by fixing x; and x2 and assume that (x1, x2) = 1. Given a particular 
solution (y;, y5), all solutions of Eq. (30) are of the form 


yi=y, +tx2, y2 = ys — tx, teZ. 


We observe that 


/ —= 
ly5 y2| < ily 
|x1| x1 | 


[t|= 


We conclude 


ams YY (2— +1) <sy ee 


eek * ieee 
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Thus, Eq. (30) has < XY integer solutions. 
If (x1, x2) =d > 1, Eq. (30) has an integer solution, provided that d|n. In this 
case, we take 


From above, the number of integer solutions to the equation 


f £- n 
XV +%Xy2 = a 


is < XY/d. Finally, we conclude that 


1 
q(n)< XY) > . 
d\n 


From the above lemma we deduce the following: 


Lemma 3.8 Let f(x) be a polynomial of degree 2 with integer coefficients, say, 
f(x) =c2x? +.c1x + c9 with cz = O(1), c) = O(P) and cy = O(P?). The number 
of solutions in the variables x; and y; such that 0 < xj, yj < P for 1 <i <4 to the 
equation 


f(x) + f (x2) + £3) + fa) = FO) + £02) + £03) + Fa) (31) 
is << P®. 
Proof We observe that 
fi) — fi) = @i — yi) [e2Gi + yi) +1]. 


We put z) = x; — yj and wj = c2(% + yi) + ¢1. 
The number of solutions of Eq. (31) is less than or equal to the number of solu- 
tions of the equation 


Z1W1 + Z2W2 = —Z3W3 — 24W4, 


where z; < P and w; « P. By Lemma 3.7, we see that for a fixed n > 0, the 
number q(n) of solutions of zj}w 1 + z2w2 =n is 


<P? ifn=0 
and 


1 
K ae ee ifn>1. 
d\n 


Thus, the number of solutions of the equation 


ZW + 22W2 = —73W3 — Z4W4, 
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where z; < P and w; < P, is 


2 
Yo qny?«P®+ (D3) 


In|xP2 1<n<c P? d\n 
1 
« P° + P* — 
2. ae 
I<n<cP? 41!" 
dy\|n 
is 52s 


« P® + P* — 
Ps dp [d), d2] 


1<dy<cP? 


(d,, d2) 
« P° + P® 
2 (d\dz)* 


1<d, <cP2 
1<dy<cP? 


[o,@) [o,@) 
1 
« P§+P°)* 5° —___ « P® 
3/2 ? 
datdgnt (2192) 


where in the penultimate step, we used the elementary observation that (d, do)? < 
dd. This proves the lemma. 


Let us note that the proof of this lemma shows that the number of integer solu- 
tions of the equation 


nyhy +ngh2 = —n3h3 — nah, 


with |n;| < X and 0 < |h;|<Y <X is <(XY)?. 
As a precursor to Linnik’s theorem, we now prove the following theorem, due to 
Hua [83]. 


Theorem 3.9 Let k > 2, and f(x) be a polynomial of degree k with integer coeffi- 
cients, say, 


Ff (x) = egx* + ocg_ix® +--+ ex +00 


such that 
ck = O(1), crk-1 = OP), ..., C1 =O(P*"), co = O(P*). 


Then 


da =O(P® '*), (32) 
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Proof Let us start with k = 2. Observe that 


1| P 8 1 P 4 P 4 
Dy es de -|/ (pewen) a) Ae 
0 0 
x=0 


x=0 x=0 
Thus, the integral in question is equal to the number of solutions to Eq. (31), which 
is < P® by Lemma 3.8. This proves the lemma for k = 2. We now proceed by 
mathematical induction and assume that Eq. (32) holds when we replace k by k — 1. 
Observe that 


P 2 PP 
See _ » 2 ect f i) f2))a 
x=0 x, =0x2=0 
> —2nif (x)a ae 2nif (x+h)a 
h=—x 
/ ! , : 
=P+4+I1+ > > ert Fath) f@))a 
h#0 xX 


where the dash on top of the summations refers to all those integers ) lying between 
—P and P and those integers x such that both x + h and x lie between 0 and P. 
Now, 


k 


k j-l 
fet) fe)= Daath 2) = De (pte. 
j=0 i=0 


Thus, f(x +h) — f(x) =hd(, h), where (x, h) is a polynomial in / of degree at 
most k — 1. Let us define 
ae i e2tihd (xh) 
x 
Then, we have 
2 
/ 
a Pih Wk ¥. ah. 
h#0 


P 
> etif (x)a 
x=0 


Raising both sides by the power 8‘~7, we have 


2.8k-2 gk-2 


’ k-2 
=(P+1+ 5a) gprs 


h#0 


gk-2 
Qnif (x)a ~ is 


h#0 


If 
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then 
2.gk-2 
z ps 


P 


SS, ectif (xa 


x=0 


Hence, 


gk-1 


da < P*8 < pS '-*, 


1| P 
/ Ss etif (x)a 
0 x=0 


since 4.84-2 > k for all k > 2. This proves the theorem, provided that 


| Yn0 an| < P. 
Suppose now that 


Then, 


gk-2 


/ 
Dd, a 


0<|A|<P 


<K 


Applying the Cauchy—Schwarz inequality, we have 
* 93(k—-2)-1 
/ / 
sal =(Eaiy<((E m0) 
h#0 


hA0 
3(k—2)—-14.93(k-2)-2. ’ k-2 3(k—2) _ ’ k-2 
Pp? +2 + sea oa lan |® = p2 > lan|® , 


hé0 hz0 


gk-2 23(k—2)-1 


Since 


a repeated application of the Cauchy—Schwarz inequality leads to an upper bound 
of the form 
gk-2 
Z pet Si ys oe. (33) 
h#0 


ee 


h#0 


This is also immediate from an application of Hélder’s inequality. So, putting it all 
together, 


P 2.8k-2 gk-2 
Da was = | P+1+>~ ay 
x=0 h#£0 
gk-2 
SN al PPO ASE lag 
0<|n|<P hz 
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By raising Eq. (33) to the fourth power, we immediately deduce 


A) F = k-2 : / ay 
/ Seria da « pP*® i) (x lan|® ) da. 
9 | x=0 9 \ no 
We write 
P gk-2 
/ . . 
> e2tihbxhja — ys Awe, 
x=0 n 
where 


n< max |o(x,h)|« pee 
O<x<P 


This gives us 


Tee gk-2 
|A(n)| = if > eotid (aha cm da 
0 x=0 
i| gk? 
</ So ettivtta de. 
0 x=0 
which by induction hypothesis is 
K ps? -k-D_ 
Thus, by Eq. (34) and the above, we get 
i| ? gl 
/ >» e2rifa] ag 
0 x=0 
4 gk-2 1 1 / gk-2 : 
« PX a > Ian! da 
9 \ 40 
« pur) > A(ny)A(n2) A(n3) A (14) 
n1,N2,N3,N4 
hy,ho,h3,h4 
nj<Pk! 
0<|hj|<P 


nyhy+nzhz=—n3h3—ngh4 


K prs 1) p3k pts 4-1) 


apr k, 


This proves the theorem. 


85 


(34) 
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Remark 3.1 Note that in all the above inequalities, the implied constants only de- 
pend on k. 


The following corollary clearly follows from the above theorem. 
Corollary 3.10 For any k > 1, there exists g > 1 such that 


fl; 


P 
y eonixta 
x=0 


& 
da < c(k) P&-*, 


where c(k) is a constant depending on k. In fact, we can take g = 8*—!. 


We are now ready to prove Theorem 3.5. 
Let k > 1 and g be as in Corollary 3.10. We have 


P 8 P 
Sere = > e2tilat tag te txg oe 
x=0 


X1,XQ,.--,Xg=0 


2mima 
= ) Cmeé ; 


m>0 


where 


i! P & 
_ _ Qmixka —2rima d 
Cm =e k(m) = e e a 
0 x=0 


By Corollary 3.10, we deduce 


1 
Cm S / 
0 


P . &§ 
Soe" | descayre 


x=0 


Choosing 


we get 
ree(m) <c(k)mE—!, 


which proves Theorem 3.5. 

It is possible to derive an asymptotic formula using the circle method for the 
number of such representations, provided that g is large enough. Essentially, all the 
ingredients have been proved above with one exception, and this is the celebrated 
Weyl’s inequality for exponential sums. Using an ingenious differencing technique, 
Wey] derived non-trivial estimates for exponential sums of the form 


S:= > eer). 


n<x 
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where P(n) is a polynomial. The idea is simple to explain. For P(n) a monic linear 
polynomial in n, the sum is a geometric series and can easily be summed. The 
estimate depends on how close qa is to an integer. When P(n) is quadratic, then 
considering |S|?, we get an exponential sum 


ecTia(P(n)—P(m)) 


N,M<x 


Using the technique that already appeared in Hua’s theorem, we can reduce this to 
the linear case and derive a non-trivial estimate for S. It is now clear that a delicate 
inductive process will lead to non-trivial estimates for S when P is any monic poly- 
nomial of arbitrary degree. Such an estimate was employed by Hardy and Littlewood 
in their treatment of Waring’s problem. They were able to derive an asymptotic for- 
mula for rg ;(m) when g > 2‘ + 1. More precisely, they showed that 


d+ 1/k)s 
where 
6(n) = > T,(n), 
q=1 
and 


q q g 
Ty (n) = > q 8 » grt eo 2tian/q. 
a=1 r=1 


(4,q)=1 
The series G(n) is called the singular series and measures the “local obstruction” to 
the problem of counting the number of solutions. Remarkably, 7; (7) is a multiplica- 
tive function of g, and one can write the series as an infinite product from which its 
positivity can easily be inferred. We refer the reader to two excellent surveys on the 
topic for further study [47] and [197]. As can be seen from the last reference, there 
is still ongoing research in the development of the method. 


4 Goldbach’s Conjecture 


In 1742, C. Goldbach conjectured that every even number greater than 2 can be 
written as a sum of two prime numbers. Consequently, this predicts that every odd 
number > 5 can be written as a sum of three primes. The binary Goldbach con- 
jecture is still open. However, the Goldbach conjecture for odd numbers is now 
a theorem due to Vinogradov. His theorem is a good illustration of how one may 
combine elementary methods with sieve techniques to prove deep theorems. It was 
originally proved by Hardy and Littlewood by applying the circle method and using 
the generalized Riemann hypothesis. We indicate how this was done below. It is not 
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our intention here to give an exhaustive treatment of the method. Rather, we shall 
illustrate its use in the ternary Goldbach problem. 
The idea can be explained as follows. Let 


Fem] > eve), ~eQee™, 


psn 


where the summation is over prime numbers. Let r(7) be the number of ways of 
writing 7 as a sum of three primes. Then, clearly, 


1 
rin)= f>(0,n)e(—nd) dé. 
0 


As explained earlier, we need to have some information on f (0,7). When @ is a 
rational number, say, 06 = a/q, we may relate the function to primes in arithmetic 
progressions as follows: 


a 
p(4.n)= > e(ba/qirrin,q,b) + O(ogq) 
q p=! 


since there are at most O(log gq) prime divisors of g. By the generalized Riemann 
hypothesis, 


li(n) 1/2 
x(n, q,b) = —~ + O(n'/? loggn), 
(9) ( ) 
where 
" dt 
li(n) = —— 
2 logt 
Now, the sum 
q 
> e(ba/q) 
b=1 
(b,q)=1 


is a Ramanujan sum and, as a is coprime to q, is equal to j4(g) where jz denotes the 
Mobius function. 
Inserting this into the above expression for f(a/q,n) gives 


(0) = BB. fe 


Thus, the GRH provides us a good approximation of f(6,n) when 6 = a/q and q 
is not too large. Since the function f (6,7) is a continuous function of 6, one may 
apply partial summation to deduce its behaviour in a suitable neighborhood of a/q. 
Thus the strategy in the circle method is to decompose the interval into “major arcs” 
suitably controlled and the set of “minor arcs,” which is just the complementary 
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set. LM. Vinogradov, in his proof of the ternary Goldbach Conjecture, used the ele- 
mentary sieve of Eratosthenes to estimate f(0,) for 6 belonging to a “minor arc.” 
Putting those two estimates together, he was able to obtain the desired asymptotic 
formula for r(n). The use of the GRH can be replaced by the Siegel—Walfisz the- 
orem, which is what Vinogradov did, and thus, the whole derivation can be made 
unconditional. As we indicate below, the method breaks down for the binary Gold- 
bach problem. 


4.1 Basic Lemmas 


We recall, for the benefit of the reader, two elementary results that will be used 
repeatedly in the proof below. The first is a classical lemma of Dirichlet: given any 
positive real number 6 and a rational Q > 1, there exist (a,q) = 1 with g < Q so 
that 


1 
ge =als-—. 
Q 
Indeed, consider the Q + 1 numbers {n6}, 0 < m < Q. If we subdivide the unit 
interval into Q equal parts each of length 1/Q, we see that two of these numbers 
must lie in the same subinterval. Thus, 


\(on 18} — {m2} <5 


form, ~mz2 < Q. Hence, 
1 
as 
Q 


Setting g =m, — m2 and a = [m6] — [m26] gives the result. 
The second elementary result is Abel’s lemma. If 


B(m) = )° dj, 


j<m 


|(m, — m2)6 — ([m 16] — [m26])| 


then, 


\ > ajbj = Y- aj(BU) — BG - D) 


j<m j<m 
=) oajBG)— >) aj BG) 
j<m j<m-1 


= S) (aj — 4j41)B(j) + am Bm). 


jm 
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Let us also recall the continuous version of partial summation: 


Y> f(b) = Bln) f On) — i BO) f' (td. 


j<m 


4.2 Major Arcs 


As indicated in the previous sections, we have 


1 
rin)= f>(0,n)e(—n6) dd. 
0 


Since the integrand has period 1, we have 


1 


y—i 
riny= f * £36, n)e(—n6) dé 


Q 


for any QO > 0. 

Now let (a,g) =1,a<4q,q <log°n, Q =n/(log‘ n), where c is a constant to 
be chosen. 

Define J (a, g) as the set of points 0 such that 


1 
a, 


~ Q 


Observe that no two distinct intervals I (a, q) overlap for if 6 € I(a, qg) and I (a’,q'), 


then because aq’ 4 a’q, we have 
/ 
2 
gla) aa aro |r 
q q Q 


1 
which implies Q < 2qq' < 2log*° n, a contradiction for n large enough. The major 


94 
arcs are by the definition 
m= UY U lao. 
qslog’ n (a,qy=1 


a 
I 
q 


a a 


q 4 


,— 


The minor arcs are [0, 1]\99t, or more accurately 


-—1 1 
=|—,1—-— — |\St. 
E ah 


We then have 
r(n)= / fO, n)e(—n@) dé +f PPO, n)e(—né) dé, 
mM m 


and we analyze each of these intervals separately. 


4 Goldbach’s Conjecture 
4.3 Application of Partial Summation 


Let us begin with the observation that for any trigonometric sum of the form 


S(0,n) = > dme(mé), 


m<n 


we have by partial summation 

S(O; + 62,n) = S(O), n)e(@2n) — 27162 i S(0,, u)e(Ou) du. 
Applying this to f(@,) gives 

f(@i + 2, n) = e(O2n) f (01, n) — 277i 62 i’ f(A, uwe(O2u) du. 


We also introduce the function 


1 
gO,n)= >> ae 


ms<n 


e(m@) 


and find 


n 
g(O1 + 62, n) = e(62n)g(01,n) — 2zion | g(O1, u)e(O2u) du. 
0 


4.4 Primes in Arithmetic Progressions 


Applying the Siegel—Walfisz theorem, we find when 0 = a/q, 


a L(q),. n 
—,n)=—"1 
($-") $(q) im +0(—-). 


where we have used the fact that 


q 


u(q)= >> e(ba/q), 
Co 
a familiar Ramanujan sum. Let us observe that 


0.2) = i seo (e 
n)= = 
a7 logm Jo logt logn 


m<n 


=li(n) + o( : ) 
logn 
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Thus, 


By partial summation, 


a a n a 
(é +- y.n) = coms (4.n) - aniy f (4. 1 Jelou)au 
q qd 0 q 


and 
n 
g(y,n) =e(yn)g(0,n) — aniy f g(0, ue(yu) du, 
so that 
a (q) 
(< +y.n) —-——a(y, o| 
| q $(q) 
n "| (a L(q) 
< +2niyi f"|r(S.u) - 22 (0, u)| au 
log’ n ae, q $(q)° 
n . log4 n 
Kaa if |y| < ——. 
log“ n n 
Since 
|e? — w?| = |z— w||z7 + w? +z, 
we immediately deduce 
a LQ) 3 3n3 
f(S+yn)- She (y,n)} < (*) 
| q o(q)° log’ n 


for |y| < (log n)/n. 


4.5 The Singular Series 
We can now write down the main term of the asymptotic formula. Let us put 
J(a,q) =) f> (0, n)e(—n6) de 
T(a,q) 


and 


1/Q ‘ 
n= {| Si wemay: 
—1/Q 
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so that the inequality of the previous section leads to 


(q) na 3n> log’n 3n2 
yan a he( ) < a 2 < oe (+) 
$°(q) q log4n n log4~¢n 


We can choose A as large as we please. If we put 


p(n) = iS (logm)~' (log)! ogm3)', 


mi +m2+m3=n,mj>2 


then 
1/2 
p(n) = [8 omen) dy. 


Let us note that 


2 
= < p(n) <n’, 
3 log? n , 
as is easily verified. Moreover, 
1 
> e(my)) << — 
ome ly 


as is also easily deduced since the sum is a geometric series. Thus, by Abel’s lemma, 


_yremy) 1 
siy.n= >> een er 


m<n 


Thus, 


n2 


2c py" 


1/2 
i g°(y,n)e(—ny) ay KO? «K 
1/Q log 


A similar estimate holds for the integral over the range [— 5 =o: Thus, by («*), 


1/Q A n2 
p= | g (y.me(-ny) dy + O( ae ) 
-1/Q log n 


Hence, from (*) we immediately see that 


L(q) na n? 
yon pg” nel \<gen 


We denote the Ramanujan sum 
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so that 
H(q) 
dd, J@a-e@) D7 Gye) 
q<log’n (a,q)=1 q<log’ n 
n2 5 2 
loon ae — 
log4-¢n r log4-*¢ n 
Thus, 
f 6n) do u@) (me 
ea ,nje(—0n) dd = pin) >> Bett Ol cua, 
q<log’ n 


and the sum on the right-hand side is called the singular series. It is not difficult to 
see that 


L(q) va H@ 1 
De gage = a Ne aie jo 


qxlog®n 


and using the fact that c, (7) is multiplicative, we find that the series is 


nn) 
; Qa? 
When n is odd, this series is 
Cp(n) = 
2 ecm Sas Be (1-— a) (- =)= i, 
No-% aE NG- Gag)? 


Thus, in particular, the series is not zero. If we can show 


3 i n* 
f3(0,nye(—nb) do) « 
m lo 


pa 


we can conclude that 


(n) n2 
ro) = em] (1-22, ) +o( ) 
z (p=? log*tn 


and that 


n 
r(n) > 3 
log? n 


completing the proof. 
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4.6 The Minor Arcs Estimate Using GRH 
We begin by noting that 


1 
if f> (0, n)e(—n6) dO| < sup |f0.m)| f | £70, n)| d0 < x(n) sup | f (0, n)| 
m dem 0 dem 


by Parseval’s formula. Following Hardy and Littlewood, we want to indicate how 
the generalized Riemann hypothesis (GRH) can be used to supply such an estimate 
in a quick way. 

Let us observe that (as before) 


r(<.n) =" e(pa/q) 


psn 


= ye e(ba/q)m(n, q, b) 


(b,q)=1 
= ED i+ YO etba/are (nab), 
(b,q)=1 
where 
OUD Sn De 
RG ee ee a 
Now, 
E(n,q,b) = 5} 5 Y= Xb)x(n. x), 
X#XO 


where the sum is over non-trivial characters x (mod q). Thus, 


Yo e(ba/q)E(n,q,b) = xO 5 DL, Klay a(n, x), 


(b,q)=1 X#XO 


where t(x) denotes the Gauss sum attached to x. The GRH is equivalent to the 
estimate 


|x(n, x)| «nil? logqn, 


so that we get 


*(é. n) = ee + O(q'?n'/? logqn). 


With these remarks, we may proceed to estimate f(@,n) when @ is an element of 
the minor arcs. We observe that Dirichlet’s lemma gives that there are g, a such that 
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|gO —a| <1/Q. As 0 €m, we must have g > log n. Thus, 


| a 1 1 
g—-|<—-<- 
q| qQ-sn 


Let y = 6 — a/q. Then, by partial summation, 


rom=s(* +y.n) =eoms(S.n) ~aniy f° f(S.u)eourdu 
q q 0’ \q 


so that 


n 
log?! n 


Choosing c sufficiently large gives us the desired estimate for the minor arcs. 

The elimination of the use of GRH by Vinogradov emanates from two simple 
ideas. The first is an estimate for a trigonometric sum. The second is a clever ap- 
plication of the sieve of Eratosthenes. Combining both of these ideas, Vinogradov 
showed that the estimate of the function f (0,7) on the minor arcs can be deduced 
without GRH. We refer the reader to [197] for details. 

The binary Goldbach problem seems to lie at a deeper level. One can derive the 
expected main term by analyzing the major arcs as we have done in the ternary 
Goldbach problem. However, the minor arc estimates are not good enough to yield 
the desired result. Clearly, a new idea needs to be injected into the method. 

These two examples, showing how Waring’s problem and the ternary Goldbach 
problem can be attacked using the circle method, should illustrate the power and 
scope of the method introduced by Hardy and Ramanujan in their foundational paper 
giving the asymptotics of the partition function. The method is still a sophisticated 
tool in additive analytic number theory and very much the focus of current research. 


|\f@.n)|« 


Chapter 6 
Ramanujan and Transcendence 


1 Nesterenko’s Theorems 


Although Ramanujan did not work in transcendental number theory, his intuition 
in the theory of qg-series, singular moduli, and the theory of quasi-modular forms 
played a pivotal role in its development. Indeed, his celebrated paper of 1916 entitled 
“On certain arithmetical functions” is notable for its wealth of ideas, not only for its 
enunciation of the Ramanujan conjecture regarding the t-function, but also for its 
emphasis on Eisenstein series and the differential equations satisfied by them. More 
precisely, Ramanujan considers the functions 


P(q)=1-24) oi(n)q", 


n=1 


Q(q) = 14240) o3(n)q", 


n=1 
CO 

R(q) = 1-504) \as(n)q", 
n=1 


where 


a(n) =) od*. 


d\n 


These functions are connected to the classical Eisenstein series E>(z), E4(z), F6(z) 
via the equations 


Eo(z) = P(e***), E4(z) = Q(e*"), E6(z) = R(27") 


with z € C and S(z) > 0. E4 and E¢ are modular forms of weight 4 and 6 respec- 
tively, whereas F2 is a quasi-modular form of weight 2. It is known [207] that the 
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algebra of quasi-modular forms is closed under the differential operator D = gd/dq 
and is generated by E2, E4, and E6. 
In his 1916 paper, Ramanujan shows that 


1 1 1 
(P* — Q), DO=-~(PO-—R), DR=>5(PR- 0”). 


DP=— == 
12 3 


These equations played a central role in the remarkable theorem of Nesterenko [142] 
concerning the special values of these functions. 


Theorem 1.1 (Nesterenko, 1996) For any complex number q with 0 < |q| < 1, the 
set 


{q. P(g), Q(q), R(q)} 


contains at least three numbers algebraically independent over Q. 


As Ramanujan was aware, the functions P, Q, R are related to many interesting 
elliptic and modular functions. This allows one to relate the above theorem to several 
specific constants. For instance, if we take g = e~°”, then one can show that 


ees, ¥en “SR /aP — 
i a ce aS CO 


Thus, Nesterenko’s theorem implies the following: 


Corollary 1.2 The numbers m, e”, and I'(1/4) are algebraically independent 
over Q. 


More generally, let 9 (z) be the Weierstrass so-function with invariants g2, g3. Let 
the corresponding lattice be generated by w1, w2 and denote by 71, 72 the associated 
quasi-periods. We suppose that z = w2/@ , with 3(z) > 0. By the theory of elliptic 
functions (see Chap. 4 of [107]), we have upon setting g = e?7"%, 


3171 | 


3 ; OT GN? 
P(qQ)= we o=7(2) 82, ro= 2 (2) 3. 


These formulas imply that P(g), Q(q), R(q) belong to the field Q(g2, 93, @1/Z, 
n, /7). From Nesterenko’s theorem, we have 


Corollary 1.3 Let go(z) be the Weierstrass go-function with algebraic invariants 
82, 83. Then the numbers 


e2Ti(@2/o1) | a\/m, y/m 


are algebraically independent over Q. 
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If the elliptic curve associated with ”9 (z) has complex multiplication, we deduce 
the following: 


Corollary 1.4 Let go(z) be the Weierstrass go-function with algebraic invariants 
82, 83 and whose associated elliptic curve has complex multiplication by the imag- 
inary quadratic field k. If w is any period of §(z), and n is the corresponding 
quasi-period and z € k with 3(z) # 0, then both sets 


{x, w, e?7'} and {w, n, e77"7} 


are algebraically independent over Q. 


Indeed, in the case of complex multiplication, 2, n2 are algebraic over the field 
Q(@ 1, 71) (see Chap. 3 of [122]). Using the Legendre relation 


271 — @1N2 = 271, 


we deduce that 7; is algebraic over Q(@1, 7) and that z is algebraic over Q(@, 71). 
This leads to the above corollary. 

For any natural number d, there exists a Weierstrass so-function with algebraic 
invariants and complex multiplication by Q(./—d). Thus, the previous corollary 
implies the following: 


Corollary 1.5 For any natural number d, the numbers 


nJ/d 


Tw, e 
are algebraically independent over Q. 


In fact, a stronger result can be deduced from the above. Using the Chowla— 
Selberg formula [34] and its relation to the theory of elliptic curves [61], we find the 
following: 


Corollary 1.6 For any imaginary quadratic field with discriminant —D and 
quadratic character €, the numbers 


D 
x, etd. []r@p 


a=1 


are algebraically independent over Q. 


In the special case that D = 3, we deduce, from the corollary upon using the 
functional equation [*(z) "(1 — z) =2/sinzz with z = 1/3, the following: 


Corollary 1.7 The numbers 1, e7 v3, and I(1/3) are algebraically independent. 
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2 Special Values of the '-Function at CM Points 


It is, in fact, possible to deduce further transcendence results from Nesterenko’s 
theorem, in the context of Ramanujan’s work. In 1915, Ramanujan [158] wrote a 
paper evaluating the product 


In this paper, we find some elegant formulas of the following kind: 


a ( 2a )) (1 +a) sinh(raV3) 
T](1+ = 
= P14 3a)2raJ/3 


and 


~ 1a (24! *\  F(+a)3 cosh(r(5 + a) V3) 
I ( +(42) )- r(2Q+3a)x 


n=1 
If we put w = 0 in the second formula, we deduce the strikingly beautiful evaluation 


(ee) 


1 h 3/2 
HO+5)-= (x /3/ ) 6s) 


TT 
n=1 


which is transcendental by Nesterenko’s theorem. One would expect these products 
to be transcendental for every rational aw, but this deduction can only be made (at 
present) for a limited number of rational values of a like a = 1/3, 1/4, or 1/6. Other 
products appear in Chaps. 13 and 14 of Ramanujan’s notebooks. For instance, in 


[159] we find 
wie x \\_ IF@P ee 
H( +() )-iReepe -) 


. lone) 2 

sin Zz z 
=| 1-=), 

Wz n 


n=1 


Since 


we immediately deduce from Ramanujan’s formula (36) and Nesterenko’s theorem 
the following: 


Theorem 2.1 Let a+b /—D with a € Z and b € Q be a CM point. Then, "(a + 
bs/—D) is transcendental. In particular, I (ix) is transcendental for every non-zero 
rational x. 


We refer to the forthcoming paper [138] for more applications of these ideas in 
the context of infinite products. 
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3 Special Values of Jacobi’s Theta Series 


In Part V of Bruce Berndt’s epic series on Ramanujan’s notebooks, we find 
Chapt. 35 dedicated to special values of the theta functions (in Ramanujan’s no- 
tation) 


o(q)= 4” 


and 
[o,@) 
wig=pignnye. 
n=0 


From the Jacobi triple product identity it is not hard to see that these series can be 
written as infinite products. More precisely, we have [44] 


ye, ee ee 
n(q)*n(q*) 
and 
_ nq’? 
Va = 


where 7 denotes the Dedekind 7-function. 

In his notebooks, Ramanujan evaluated many special values of g and y. All of 
his results are related to either g(e~*) or y(e~—**) for certain rational values of x. 
The striking feature of his evaluations, tabulated on p. 325 of [20], is that the first 
group ((i)-(x)) of values of g(q) are algebraic multiples of 

1/4 
oF GD 
whereas the second group of values of y(q) are algebraic multiples of ae”™/** 
where r is a natural number. 

The chapter in [20] provides no theoretical explanation for these formulas. Nor 
does it give any reason why only !/4, (3/4), and e” appear as the transcendental 
quantities. This fact could have only been deduced from the work of Nesterenko. 

We indicate briefly the theoretical rationale for these results. The detailed calcu- 
lations will be given in a forthcoming paper. For the moment, suffice it to say that 
the 24th power of g(qg) or w(q) allows us to express it in terms of Ramanujan’s 
A-function. This, in turn, can be expressed in terms of E4(z) and E¢(z), and then 
Nesterenko’s work can be invoked. 
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4 The Rogers—Ramanujan Continued Fraction 


The Rogers—Ramanujan continued fraction is 


F@= 
I+ 


1+". 


Its genesis is rooted in the celebrated Rogers-Ramanujan identities. These are re- 
markable identities expressed by the following equations: 


0° m2 oo 
‘ q 5m+1\—1 5m+4\-1 
G(q) := = 1- 1— : 
(q) ier er aera TT q ) ( q ) 
00 m(m+1) oO 
. q 5m+2\—1 5m+3\—1 
H = = 1- 1- : 
(q) 2 a-pd a da) TT q ) ( q ) 


The main feature of these identities is their striking beauty and simplicity. In 1936, 
Selberg [178] found generalizations of these identities. Closer examination of these 
identities reveals hidden modular connections. 

The Rogers—Ramanujan identities have an exotic history. Apparently, they were 
first discovered by Rogers [175] in 1894. It seems that he was a gifted mathemati- 
cian whose work was largely ignored. For instance, he discovered Holder’s inequal- 
ity before Hoélder did (see p. 100 of [67]). The identities were re-discovered by 
Ramanujan in 1913, before he had come to England. However, he had no proof and 
knew he had none. Since no one in Hardy’s circle of mathematicians could prove 
them, they were stated without proof in the second volume of MacMahon’s book 
on combinatorial analysis. In 1917, Ramanujan was looking through some old vol- 
umes of the Proceedings of the London Mathematical Society and stumbled on the 
1894 paper of Rogers. Hardy wrote, “I can remember very well his surprise, and 
the admiration which he expressed for Rogers’s work. A correspondence followed 
in the course of which Rogers was led to a considerable simplification of his orig- 
inal proof.’ Around the same time, Schur [177] rediscovered these identities and 
published two combinatorial proofs. In 1919, Ramanujan [163] published another 
(more analytic) proof. 

In his 1894 paper, Rogers [175] proved that 


A(q) 
G(q) 


Using the Rogers—Ramanujan identities, the right-hand side can be written as 


[o,e) 
I] (1 = gaye 


n=1 


q'P F(q)= 


5 Nesterenko’s Conjectures 103 


where x5(7) is the non-trivial quadratic character (mod 5) given by the Legendre 
symbol (5/7). If we put g = e*", then writing the right-hand side in terms of theta 
functions, one can show that in fact, F(e2”*) is a modular function for (5) (see, 
for example, [43]). 

The stage is now set to apply Nesterenko’s theorems to F(q). These were derived 
by Duverney, Ke. Nishioka, Ku. Nishioka, and Shiokawa [44]. The explicit relation- 
ship of this function to modular functions is given by the following equation: 


1 n(q'/?) 
—— =F(q)+1+q"° 
F(q) w 71@ 


where 77 is the Dedekind 7-function. In [44], it was shown that Nesterenko’s theorem 
implies the following: 


Theorem 4.1 F(a) is transcendental for any algebraic a with 0 < |a| <1. 


In Ramanujan’s letters to Hardy, we find several explicit evaluations of F. For 
instance, in his first letter, Ramanujan wrote 


F(e") = [5 zee 1 ned 


He further wrote that F(e~” Vv") can be determined exactly for any natural number 
n. From the modular perspective, this is now clear since F being a modular function 
means that it is a rational function in j(z) and j(5z), where j is the j-function. 
Ramanujan was evaluating the function F at CM points in the upper half-plane, and 
for such values, the j-function assumes algebraic values by a classical theorem of 
Schneider (see, for example, [135]). 


5 Nesterenko’s Conjectures 


Nesterenko conjectures that if z € C, 3(z) > 0, and g = e?7% 


true. If the set 


, then the following is 


{z.9, P(q), Q(q), R@)} 


contains at most three algebraically independent numbers over Q, then z lies in an 
imaginary quadratic field, and the numbers 


q, Pq), O(@) 


are algebraically independent. 

The work of Nesterenko allows us to study transcendental values of modular 
forms at algebraic arguments. An example of this is illustrated by the following 
theorem proved by Gun, Murty, and Rath [64]. To state the theorem, we say that 
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two modular forms f, g are equivalent if there are two natural numbers k;, k2 such 
that f* =Ag", with A algebraic. 


Theorem 5.1 Let a be anon-CM point in the upper half-plane. Let Sy be the set of 
non-zero modular forms with algebraic coefficients, of arbitrary weight for the full 
modular group, for which f (a) is algebraic. Then up to equivalence, Sy has at most 


one element. 


If we assume Nesterenko’s conjecture, then there is no exception, and the set Sy 
is empty. 


6 Special Values of the Riemann Zeta Function and g-Analogues 


In Entry 21(1) of Chap. 14 in Ramanujan’s second notebook, we find the following 
remarkable formula. If a, 8 are positive numbers such that a6 = 2’, and if r is a 
positive integer, then 


fl = 1 


offi . : 
— (—46) (550° +1)+ 3s ae) 


1 is 
en (—1)* Boy Bor 49-2p00" +1 pt 
QH1Qr+2—-%! 


k=0 


where the B;s are the Bernoulli numbers. (See pp. 275—276 of [19].) Apart from its 
intrinsic beauty, this formula is interesting for a variety of reasons. We know that the 
special value of the Riemann zeta function ¢(s) for s = 2k is a rational multiple of 
mk If 5 = 2k + 1 is odd, the transcendence of ¢(2k + 1) has not been established. 
Ramanujan’s formula expresses these values as a rational multiple of 27+! plus an 
additional term which can be interpreted as an Eichler integral. The transcendence 
of this Eichler integral is studied in a recent paper of Gun, Murty, and Rath [65]. We 
elaborate on this below. 

Apparently, the formula also appears in Ramanujan’s lost notebooks, and from 
this we can surmise that it was discovered before Ramanujan’s arrival in England 
in 1914. In [22], Berndt relates that several mathematicians re-discovered special 
cases of this formula, the most notable being Emil Grosswald [62], who extended 
its range of applicability to the upper half-plane. For instance, in the special case 
r = 1, we get 


= 1 In? 
3)+2 — : 
sees d n3(e27" —]) 180 
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Thus, at least one of the terms on the left-hand side is transcendental! Similarly, one 
can deduce that at least one of 


lee) 
1 
5 (4k ef 3), d nak+3 (e2nn = 1) 


is transcendental for every integer k > 0. These identities suggest we consider the 
function 


[o,e) 
Fi (2) =) o-a(nye""”. 
n=1 


One can re-write this as 


~ 1 
o(k) d Seer 


For odd negative values of k < —3, Fx(z) is essentially the Eisenstein series Ej; (z) 
(apart from the constant term) of weight 1 — k for the full modular group. For odd 
positive values of k, one can view Fx (z) as an Eichler integral, which is obtained by 
integrating a classical modular form. Indeed, noting that 


o_x(n) =n *ox(n), 


we may re-write F;,(z) as 


[o,@) 
oh ox(n) 2minz 
7 ‘ 
n 
n=1 
It is now clear that F(z) can be obtained by successive integrations of the classical 
Eisenstein series minus its constant term. Fx(z) is not quite a modular form, and it 
seems natural to study what happens to it under modular transformations. Thus, for 
any given k > 4, we can look at 


Foes (Z) — 278 Fog (—1/2z) 


and study the special values of this function at algebraic points. The reason for doing 
this is clear since this expression is, by the Ramanujan—Grosswald formula, 


-\2k+1 kK+1 ; ; 
(271) 2k+2—2j Boj BK 42-2; 


2z _ (fk +2—2))! 


1 
55 (2k + 1)(2* — 1) + 


If for some value of z which is not a 2kth root of unity, we have a zero of the 
polynomial appearing in the summation, we can then write ¢(2k + 1) as a difference 
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of two Eichler integrals. Indeed, the zeros of the polynomial 


k+1 


_9; Bo; Box42-2; 
RRH@ = nee a aa 
= (2j)\(2k +2 —2))! 


called the Ramanujan polynomial (of degree 2k + 2) by the authors in [137], has 
been studied by Murty, Smyth, and Wang. For k > 4, they showed that all the zeros 
of Rox+1(z) are simple and lie on the unit circle, apart from four real roots. More- 
over, the only possible roots of unity which are zeros of R2x+41(z) are +i (which 
happens if and only if k is even) and +p, +p? with p = e*”'/3 (which happens if 
and only if 3 divides k). As a consequence, they deduce that for k > 4, there is an 
algebraic a@ with || = 1 which is not a 2kth root of unity so that 


(Fox-1 (a) — o7* Foe (—1/a)). 


2 
2k+1)= 
(Qk +1) =a 


The authors in [65] were led to consider the function 


1 2k 
Goe4i(z) = aE (Fat @) — 2" Fox41(—1/2)) 
since a special value of this function is ¢(2k + 1). They prove the following: 


Theorem 6.1 The set 


{Gar¢1(z)  3(z) > 0,2 € Q, 2* £1} 


contains at most one algebraic number. 


It is clear from these results that Eichler integrals will play a prominent role in 
our understanding of the special values of ¢(2k + 1). 

Apéry’s surprising result that ¢(3) is irrational proved in 1977 seemed to be an 
isolated result. All attempts to generalize it to ¢(5) and more generally to ¢(2k + 1) 
met with limited success. Only recently did Rivoal [173] succeed in showing that 
infinitely many of ¢(2k + 1) are irrational. For instance, we now know that the 
dimension of the vector space spanned by the values (3), ¢(5),...,¢(2a + 1) is 
greater than c log a for some positive constant c. These results emanate from a subtle 
study of hypergeometric series along with an irrationality criterion of Nesterenko. 
The theory of g-series and its interconnection to the theory of modular forms and 
mock modular forms seem to suggest a new paradigm for research in this context. 
Indeed, new light is shed on the irrationality questions by studying the g-analogues 
of the Riemann zeta function. This analog is not unrelated to the functions P,Q, R 
of Ramanujan. It seems natural to define the g-Riemann zeta functions 


© Ts-lan 


t(s)= 


n=1 
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and try to study irrationality and transcendence of these values for specializations of 
q and s. There are good theoretical reasons for doing so. Indeed, ¢, (2), 4 (4), Sq (6) 
are essentially the same as Ramanujan’s P, Q, R respectively. Nesterenko’s theorem 
gives us precise information on transcendence of these values, that is, ¢,(s) when 
s is even, since from the theory of modular forms, ¢, (2k) is a polynomial in ¢, (4) 
and ¢, (6) for k = 2. What exactly is the situation with ¢, (2k + 1)? 

Recently, Rivoal and others have obtained interesting results in this direction. 
For instance, Krattenthaler, Rivoal, and Zudilin [101] have shown that the Q-vector 
space spanned by the values ¢(1), ¢,(3),...,¢q(@) has dimension at least ciJa 
for some positive constant c;, provided that g is different from +1 and a is an odd 
integer. 

The irrationality of ¢,(1) with q algebraic is connected with the study of the q- 
exponential function and the g-logarithm. The general philosophy of transition from 
the world of natural numbers to the world of g-numbers is to relate n with gq” — 1. 
Thus, the analog of the g-exponential function is 


x” 


at GG yea 


and the q-logarithm is 


lee) xn 
L — 
(OO Dey 
n=1 
It is not difficult to show that 
bas x 
ég(x) = H( 4: a) 
n=1 a 
and 
= x 
£ = ; 
g(x) eer 


One can relate the value of ¢, (1) to the special value of a g-logarithm. Even for the 
q-exponential function, one needs the strength of Nesterenko’s theorem to assert that 
€q (1) is transcendental for algebraic values of g with |g| > 1. It will be interesting 
to study and develop transcendental number theory to the domain of its g-analogs. 

From these results it is clear that this “quantum” world seems to lie deeper than 
the world of natural numbers. This line of investigation may ultimately shed new 
light on our understanding of the special values of the Riemann zeta function at odd 
arguments. 


Chapter 7 
Arithmetic of the Partition Function 


Ramanujan’s work on the partition function can be divided into two parts. The first 
part deals with its arithmetic properties, congruences and identities, now called the 
Rogers—Ramanujan identities. The second deals with his joint work with Hardy 
on its asymptotic behaviour, for which the elaborate circle method was developed. 
Since we discussed the latter method in an earlier chapter, we focus here on the 
arithmetic side of the partition function. As will be evident, elliptic functions and 
the modern theory of £-adic representations play a major role in the exposition. 


1 Ramanujan’s Congruences 


Recall that a partition of a natural number n is the decomposition of n as a finite 
number of natural numbers. For example, the five partitions of 4 are given by 


4=341=242=2414+1=14+14+1+41, 


so we may think of the summands as being arranged in decreasing order. To the 
partition n =A, +A2+---+A, with A, > Az >--- >A, we may associate a diagram 
(often called the Young diagram) which has r rows and in the jth row, we place A; 
empty boxes. The number of partitions of n will be denoted by p(n). 

At the time of Ramanujan, very little was known about the arithmetic nature 
of p(n). Even today, though we have a better understanding, there are still many 
unanswered questions regarding it. For instance, it is still unknown if p() is even 
“half” the time. More precisely, is it true that 

. #{n<x:p(n)even} 1 
lim =.) 
Xx>CO Xx 2 


The conjecture is “yes”. Looking at values of p(n), one would guess that perhaps 
p(n) is “random” and no special residue class is favored. Yet, Ramanujan found 
some surprising congruences: 


p(5n+4)=0 (mod5), 
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p(in+5)=0 (mod7), 
p(iin+6)=0 (mod 11). 


Surprisingly, one needs the theory of elliptic functions to prove these results. In 
paper 30 of his Collected Papers, these results were published posthumously in 1921 
in Mathematische Zeitschrift, prepared for the journal by G.H. Hardy and based on 
an unpublished manuscript of Ramanujan. 

What is interesting in Ramanujan’s paper is that he actually proves more general 
results and the congruences stated above are corollaries of these general results. 
However, here we will follow an exposition of Berndt [21] that gives a simplified 
treatment. 

As in Ramanujan [165], we have the Eisenstein series 


oo neg” 
Q= ee ear 
n= 


0° nog” 
BN ig 


If D denotes the differential operator gd/dq, then these Eisenstein series satisfy 


2 2 
P mee pou == peat tae: 
12 3 2 
as was shown by Ramanujan in his celebrated paper [161] of 1916. 

The easiest way of establishing the validity of these differential equations is as 
follows. If we let q = e?”!%, then Q and R are modular forms of weight 4 and 6 re- 
spectively. P is not a modular form but a quasi-modular form and has the following 
transformation law. Let E2(z) = P(e?!) and G2(z) = 2¢ (2) E2(z). Then, we have 


DP= 


Goz+)=Go(z+1), — Ga(-1/z) = z*Ga(z) — 2miz. 
Then, it is straightforward to check that 
DP —(P?—Q)/12 


is a modular form of weight 4 which vanishes at infinity. Consequently, it is a cusp 
form of weight 4. But there are no non-trivial cusp forms of weight 4, and so this is 
identically zero. The other differential equations are established similarly. 

In addition to this, we have the well-known discriminant relation 


3 _ R2 oo 
o = A@=a] [Ci — qr)". 


n=1 


1 Ramanujan’s Congruences 111 


Following Ramanujan’s notation, we will denote by J any qg-series with integer 
coefficients and emphasize that J is not necessarily the same at each occurrence. 
Since we are interested in congruences, this is a useful device (similar to the big ‘O’ 
notation of Landau). We begin with the simplest congruence. 


Theorem 1.1 
pOm+4)=0 (mod5). 


Proof Using our notation, we have 


Q=14+5J, R=P+5J, 


the latter coming from n> 


theorem. We have 


=n (mod 5), which is a special case of Fermat’s little 


oO? — R*=Q(14+5J) — (P45) =O- PP 4+5J. 
From the differential equations satisfied by P we see that this is 
=-12DP+5J=3DP+5J. 


On the other hand, 


Be ash oo 7 (le ay" 
Q? — R* = 1728q | [(1-4") “= aa] Oe ay Su: 
n=1 


Using the fact that 


we deduce that 


0° R= a4( Sma") [TA qo) 45J. 


n=0 


Combining this with our earlier formula for Q3 — P*, we obtain 
lee) lee) 
3DP =3q » po [[@-47") (mod 5). (37) 
n=0 n=1 


To deduce the desired result, we need only make the observation that if J is any 
integral g-series given by 


[o,@) 
J= J a(n)q", 
n=0 
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then 


CO 
DJ= Sy“ na(n)q”, 


n=1 


so that the nth coefficient is always divisible by n. In (37), we have 


DP | [(i-q>") tsa pe (mod 5), 


and we compare the coefficients of g*” on both sides of the congruence to deduce 


the desired result. Indeed, since all the coefficients of g>” of DP are divisible by 5, 
the same is true of the product on the left-hand side which is an integral q-series. 
So the same must be true of the right-hand side. But the coefficient of g*” on the 
right-hand side is precisely p(S5m — 1). 


The other congruences are obtained similarly. One needs to use some further 
identities of the following kind. 

Since the q-series of any modular form of weight > 4 and level | can be written 
as a polynomial in Q and R, it is not surprising that we also have 


q" 


Q? = 14480) 


n=1 


(38) 


OR=1 ey 
n=1 
and 


nilgn 
fT 


44103 + 250R? = 691+ 65520) 


n=1 


the right-hand sides being the q-series of Eisenstein series of weight 8, 10, and 12 
respectively. 

The congruences for 7 and 11 are progressively harder. For the congruence mod- 
ulo 7, only the identity for Q is needed. For the congruence modulo 11, the other 
two identities are essential. We proceed with the easier case and follow [21] in our 
discussion filling in some details. 


Theorem 1.2 
p(Jm+5)=0 (mod7). 


Proof In our notation, we have 


R=1+7/, 
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and using Fermat’s little theorem (with p = 7), from (38) we have 
OQ? =P+T7J. 


Hence, modulo 7, we have 


(O° — R*)” =(PQ-1)=P?Q*-2P0+1 
= P?-2PQ+1=P(P*—Q)—-PQ+R. 
Using the differential equations satisfied by P, Q, R, we obtain that 
(O° — R*)"=5P(DP)—3DQ _ (mod_7). 
This is the same as (modulo 7) 
6DP? —3DQ. 
On the other hand, 
(03 - R*)? =@?J] (1-a") alters 
n=1 
Reducing this modulo 7, we obtain 
00 00 
(9° — R’)’= ey pone) TT [[G@-4°") (mod 7). 
n=0 n=l 
Combining this with our earlier congruence, we obtain 


dP? d 
2 ey pin)q" = Ta- gy (64 — 3q Z) (mod 7). 


n=1 


Noting again that for any g-series J with integer coefficients, the nth coefficient of 
DJ is always divisible by n, we need only compare the coefficients of q’” of both 
sides of this congruence to deduce the desired result. 


The key point to note here is that all of these congruences are emanating from 
Bernoulli numbers appearing in the appropriate Eisenstein series. This again con- 
nects naturally to the theory of ¢-adic representations and the Serre—Swinnerton- 
Dyer theory of congruences. 


2 Higher Congruences 


Based on very scanty numerical evidence, Ramanujan formulated a more general 
conjecture [164] modulo powers of primes 5, 7, and 11. In 1930, Chowla [33] and 
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Gupta (independently) found a counterexample to his conjecture modulo 7’, so that 
some correction was needed. The precise version is given by the following: 


Theorem 2.1 Let p be 5, 7, or 11, and let 5p.% be the multiplicative inverse of 
24 (mod p*). Then, 


(1) p'm + 85,4) = 0 (mod 5); 
(2) p(7km + 67,4) = 0 (mod 7*/21+1), 
(3) p(.1km + 811,~) =0 (mod 11*). 


In 1938, Watson [199] proved (1) and (2). Atkin [11] proved (3) in 1967. What 
do these proofs involve? What are the conceptual reasons behind them? Folsom, 
Kent, and Ono [53] used the theory of £-adic modular forms to give a conceptual 
explanation of these congruences. We refer the reader to this paper for further de- 
tails. 


3 Dyson’s Ranks and Cranks 


In 1944, Dyson made an important observation to give combinatorial explanations 
for these congruences. If (A;,...,A,) is a partition 4 of n (denoted A 1 7), he de- 
fined the length of 4, denoted £(A), to be r and the rank of 4, denoted r(A), to be 
A, — £(A). For instance, the partition of 4 given by (2, 1, 1) has length 3 and rank 
—1. Dyson considered the functions 


R(n,m, M) =#{A En: r(A) =m (mod M)}. 
He conjectured that 
pOn+ 4) =5R(5n4+4,m, 5) 


for every 0 < m <4 and 
p(/n+5)=7R(n+5,m,7) 


for every 0 < m < 6. In 1954, using the theory of modular functions, Atkin and 
Swinnerton-Dyer [13] proved Dyson’s conjecture. Unfortunately, Dyson’s rank 
function did not apply to Ramanujan’s congruence modulo 11. So he was motivated 
to conjecture that there is another function, which he called the “crank” function, 
that would explain combinatorially Ramanujan’s congruence (mod 11). In 1988, 
Andrews and Garvan [8, 56] found Dyson’s “crank” function. It is defined as fol- 
lows. Given a partition A of n, let o(A) be the number of 1s in A, and jz(A) be the 
number of parts larger than o(A). The crank of 4, denoted c(A), is given by A, if 
o(A) = 0 and w(A) — o(A) otherwise. It turns out that the crank function gives a 
combinatorial explanation for Ramanujan’s congruences (mod 5) and (mod 7) also. 
Indeed, define 


S(n,m, M) =#{r En:c(A) =m (mod M)}. 
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Theorem 3.1 
(1) For eachO0 <m <4, 

pn +4) =5S(5n +4, m, 5). 
(2) For eachO0<m <6, 

p(In+5) =7S(7n + 5,m,7). 
(3) For eachO <m < 10, 


p(In +6) = 11S(11n + 6, m, 11). 


There has been further work of Garvan, Kim, and Stanton [57] in this con- 
text. Ramanujan expected that there are no such simple congruences for primes 
greater than 11. In searching for a theoretical explanation for Ramanujan’s intuition, 
Ahlgren and Boylan [3] in 2003 proved the following theorem. 


Theorem 3.2 /f € is a prime and 0 < B < £ for which 
p(én+B)=0 (mod é) 
for every n> 1, then 
(€,B) € {(5, 4), (7,5), (11, 6}. 
A nice exposition is given in [151]. Thus, Ramanujan’s congruences for 5, 7, and 
11 are the only “simple” congruences for p(7). 
Atkin, Klover, Lovejoy, and Ono have done further work extending this result to 


prime powers. Surprisingly, these extensions involve modular forms of half-integral 
weight. More precisely, let 


lee) 
F(z) =n (242)? =) ang” =q"? — 19g + 152q°" + ---, 


n=1 
[o,@) 

G(z) = (242) = Do b(n)q" = 478 — 23g"7 + 23097 +--+. 
n=1 


If x12 is the quadratic character given by the Kronecker symbol (12/-), then F is a 
cusp form of weight 19/2 for [(576) and character x12. G is a modular form of 
weight 23/2 for the same group and character. We then have 


p(5!n + 65, ;) =3/~'5/a(24n+ 19) (mod 5/*') 
for j odd and 
p(5/n +5, ;) =3/~'5/b(24n +23) (mod 5/t!), 


for j even. These results open a new line of research. 
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4 Parity Questions 


The question of determining the parity of the partition function p() was raised by 
Ramanujan (see [24]), but to date, has not been satisfactorily answered. In 1959, 
Kolberg [100] proved that p(n) is even infinitely often and odd infinitely often. 
Since his proof is elementary and short, we give it here. We begin with Euler’s 
pentagonal number theorem: 


[o,e) [o,e) 


[[@ i q") = = C1 gen De: 


n=1 n=—0Oo 
Since 


[]@-4")' =o pana", 


n=1 n=0 


we deduce the recursion (Euler’s identity) for n > 1, 


p(n) — p(n—1)— pa—2)+ pa—5)+---=9, 


where the general term is given by 


(—1)* p(n — kk + 1)/2). 


If p(n) is even for all n > a, then with n = a(3a — 1)/2, the above identity gives 


p(aGa — 1)/2) — p(aGa -)/2- 1) —---+ p(2a— 1) + p(O) =0, 


giving a contradiction. Hence p(n) takes odd values infinitely often. 

A similar argument works for even values. Indeed, suppose that p(n) is odd for 
all n > b. Taking n = b(3b + 1)/2 in Euler’s identity, we again deduce a contradic- 
tion since the left-hand side has an odd number of odd terms. 

A mild variation of this argument using Euler’s identity leads to the following 
result. For any given M > | and r arbitrary, there are infinitely many 7 such that 
p(n) #r (mod M). A general conjecture was formulated by M. Newman. New- 
man’s conjecture [144] predicts that for any given r and M, there are infinitely 
many n such that p(n) =r (mod M). In his paper, Kolberg uses Ramanujan’s con- 
gruences to prove some special cases of this conjecture. Namely, he shows that each 
of the congruences p(n) = 0 (mod 10), p(n) = 5 (mod 10), p(n) = 0 (mod 14), and 
p(n) =7 (mod 14) has infinitely many solutions. 

One can try to derive some quantitative results in this context. The Parkin—Shanks 
[153]conjecture predicts that p(n) is even “half” the time. As stated earlier, this is 
still open. There are partial results in this direction. In [148], the authors refine the 
Kolberg argument and essentially using the pigeonhole principle show that 


#{n <x: p(n) even} > JX. 
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They also count the number of 7 < x for which p(n) is odd. But in this case, they 
get slightly weaker results of the form >> x!/?-€ for any € > 0. This was recently 
improved by Ahlgren [2] and Ahlgren and Boylan [4]. In [3], Ahlgren and Boylan 
show that Newman’s conjecture is true for every prime modulus M, with the pos- 
sible exception of M = 3. In addition, they obtain some quantitative results in this 
direction. 

Subbarao [192] proposed the following generalization. If A, B are integers with 
0 < B <A, there are infinitely many n for which p(An + B) is even and infinitely 
many n for which p(An + B) is odd. Over the decades, several special cases of this 
conjecture were established. For instance, Hirschhorn and Subbarao [81] proved the 
conjecture for A = 16. 

Subbarao’s conjecture is practically proved. The “odd” part remains open. More 
precisely, in a beautiful paper, using the theory of modular forms, Ono [150] has 
shown that there are infinitely many n such that p(An + B) is even. If for some 
no, we have that p(Ano + B) is odd, then there are infinitely many n such that 
p(An + B) is odd. Nicolas, Ruzsa, Sarkozy, and Serre [148] have shown that the 
number of n < x such that p(An + B) is even is > ./x. In the odd case, Ahlgren [2] 
has shown that the number of n < x for which p(An + B) is odd is > /x/logx. 
Recently, some small improvements on these results were obtained by Ahlgren. 

There are other arithmetic questions one may ask concerning p(n). We refer the 
reader to an excellent survey by Adhikari and Mukhopadhay [1] for related ques- 
tions on this theme. Clearly, there is more work to be done, and what emerges from 
these recent developments is the ubiquitous modular connection in these problems. 


Chapter 8 
Some Nonlinear Identities for Divisor Functions 


1 A Quadratic Relation Amongst Divisor Functions 


Let o;(n) denote the sum of the sth powers of the divisors of n. Also, set 


05(0) = sos) 


where as usual ¢(s) denotes the Riemann zeta function. In his fundamental paper 
“On Certain Arithmetical Functions”, Ramanujan showed that there is a quadratic 
relation involving o, (7). 

We have 


VMaD= yea. 
jsx jsxdt=j 


Interchanging the summation, the right-hand side is seen to be 


So d'[x/dj=x od! 4+ (da) = 0(2't"). 
d<x d<x d<x 


On the other hand, we could also write this as 
1 


a 


t<x d<x/t t<x 


The right-hand side is seen to be 
i! r+1 
~ — +1). 
as cr+1) 
The same method will allow us to treat 
Yoo - i 
es 
and for integers x = n, we find that it is 
Pr+bDr(s+ 
Trir+s+2) 
provided that r > 0 and s > 0. 


C(r 4 Dn" tst1 
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Now let us set 
Xy,s(n) = 0, (O)os(n) + +--+ 0,(1)o5 (0). 
Then, using the fact that 
n> <o;(n) < n’(1 +2°-*4.. -) =n'C(s) 
it follows that for r > 0 and s > 0, 
Ds (1) x Trir+)r(st+)) 


lim inf sl = —Feds+2) c(r+1) 
and forr >Oands > 1, 
» rT Dr 1 
lim sup rs (”) < es oe Oe: ere DE). 


n’tstl ~  Pr+s +2) 
Ramanujan proves that whenever r and s are positive odd integers, 
rr+t+)D)roe+VDe~r4+Des64+) 
Drs (n) = Or+s+l (n) 
T(r+s+2) C(r+s+2) 
l-r+cd- 
‘ g( )+¢U—s) 
r+s 
Moreover, he shows that in a finite number of cases, there is no error term. In other 
words, the O-term above can be removed, and one has an exact identity. The values 
of (r,s) for which this occurs are given in the table below. 


2 
NOy+s5—1(n) + O(n3e+s+), 


~ 


S 


AWW Wee eee 
— 
NYONWKFAINWH 


In particular, the case r = | and s = 3 gives the identity 
01 (1)03(n) + 01 (3)03(n — 1) +01 (5)o3(n — 2) +--- +01 (2n + 1)03(0) 


1 
= —o5(2. 1). 
7407! n+1) 


2 Quadratic Relations Amongst Eisenstein Series 


Since the Fourier coefficients of Eisenstein series are given in terms of the functions 
ox (n), the relations of the previous section may be interpreted as quadratic relations 
satisfied by Eisenstein series. Using the fact that the space of modular forms of level 
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1 and weight k is one-dimensional for 4 < k < 10 and for k = 14, one gets the 
following relations: 


Eg(z) = Ea(z)”, 
E\0(z) = E4(z) £6(z), 


F\2(z) — Eg(z)E4(z) = — sere A(z) 
691 
and 
Pies hey Ss Ae 
691 


All of these identities can be expressed as relations between the functions 0; () for 
various values of k and t(n). 
Ghate [59] has classified all monomial relations between Eisenstein series. 


3 A Formula for the t-Function 


Building on the ideas of the previous section, we can also ask for relations between 
Eisenstein series and cusp forms. There is, of course, the relation 


1728A(z) = E4(z)? — E6(z)?. 


We have the Fourier expansion 


4k 
Ex(z) =1- Be Yo ox-1n)q”. 


n>1 


In particular, 


E4(z)=1+ 240° o3(n)q" 


n>1 


and 


Eo(z) = 1 — 504} °o5(n)q”. 
n>1 

This leads to a nonlinear relation for t(7) in terms of generalized divisor functions. 

Starting from Lahiri [104], several authors have investigated the possibility of 
quadratic relations between Eisenstein series and cusp forms. These relations arise 
by studying the effect of certain differential operators on the space of modular forms. 
In particular, we will discuss three formulas for the t-function in terms of divisor 
functions. 

The first of these is due to Niebur [149]. 


Theorem 3.1 Forn > 1, we have 
n—l 

t(n) =n*o(n) — 24) °(35k* — 52k?n + 18k7n)o (k)o(n — k). 
k=1 
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Proof Set 


f(@) = log A) = —2miz +24 D9 LOD). 


n=1 


Now define 
FQ) =18(fO WD) + fF OFO® — 16 Ff? OFOW. 


Then F(z) = F(z + 1). Moreover, the first Fourier coefficient is 2° - 3 - 7°. Hence, 
if we show that F(—1/z) = z!? F(z), it will follow that 


F(z) =2°? -3- 72° A(z). 
The definition of f implies that 
f(-1/2) = —logz’* + f (2). 
Differentiating this repeatedly, we get the identities 
f'(-1/z) = -12z + 2 f’(2), 

FO (H1/z) = -1227 +22 f/@O+ 7 FP}, 

FO (1/2) = — 242? + 624 f(z) + 62° f(z) +P FO], 

f (-1/z) = —7224 + 242° f’(z) + 362° f (z) + 1227 FO (z) + 2B f(z), 

f © (-1/z) = —2882° + 120z° f(z) + 24027 f(z) + 12028 f(z) 

+202? f(z) + 2 fO(Z). 


Using these equations, we find that 


3 
HQ) = fO@™+ FOFY@-F(F ()) 


is a cusp form of weight 4 for SL2(Z) and hence must be identically zero. On the 
other hand, we have 


F(-1/z) =z? F(z) — 122!! A’ (z) — 4829 A(z) 
and so 


F(-1/z) =z" F(z) 


as required. 


4 Derivatives of Modular Forms 


Let f be a modular form of weight k for a group I’. In particular, we have 


az+b\ | k 
(SH) -+a f() 
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a b 
ic ayer (39) 


az+b 
czt+d 


for elements 


Differentiating this gives 
(ad — be)(cz + d)~* r'( ) =ke(cz +d)! f@) + (cz +4)" f'@) 


and for ad — bc = 1, this is 


(3) = (cz + dy" (kef (2) + (cz +d) f'(2). 


Thus we see that differentiation in general destroys the modular property. 
Define the differential operator 


— 1 sk p= 1 k i d 

«Oni diy)  2ni\2iy * dz 
where z = x + iy. This operator, when applied to a holomorphic modular form, 
preserves the modular transformation property but not the holomorphy. Maass [118] 
proved that if f is a holomorphic modular form of weight k, then 6, f is a real- 
analytic modular form of weight k + 2. More generally, he considered iterates of 
this operator, namely 


of UN RE AS. AN IRE HE kd 
j= ; ; + : + BEE (Merrett et 
2mi 2iy dz 2iy dz 2iy dz 


and showed that it takes holomorphic modular forms of weight k to real-analytic 
modular forms of weight k + 2r. 

A large number of nonlinear identities can be produced by computing the effect 
of the Maass operator on Eisenstein series. 

In 1956, Rankin [170] proposed a differential operator on pairs of modular forms 
that preserves holomorphy. Let f be a modular form of weight k, and g a modu- 
lar form of weight 2, both for the same subgroup I” of SL2(Z). Now consider the 


operator 
(ktn—-l\(€+n-1\_, 0, 
(heh = 1 ( - )( : )p fDP". 


r 
r=0 


In particular, 


[f, gli =kfDg — U(Df)g. 
If both f and g are modular for I” (of weight k and ¢ respectively), then for 


a b 
(: nyer (40) 
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we have 
+b 1 ; ; 
Lfgh (= 7 *) = sez tak @s'@-le@s'@) 4D 
and this is 
b 
hen (St *) = (cz +a)L f, gh (2). 


In general, it is a result of Cohen (building on work of Rankin) that [f, g], is a 
modular form of weight k + €+ 2n for the same subgroup J”. Moreover, if n > 1, it 
is a cusp form. 


5 Differential Operators and Nonlinear Identities 


Many examples of Maass derivatives and Rankin—Cohen brackets have been com- 
puted. Such calculations usually result in a nonlinear identity involving Fourier co- 
efficients of Eisenstein series and cusp forms. 

Formally, for a series 


lee) 
g@)= plata" 
n=0 
we have 
lee) 
5” 9(z) = Yoon 
n=0 
where 


“(r\ Pk+r) eer 
b =) ee (4 Intl, 
- x) Pk+r—): me 


Using this and some general results of Shimura [189, 190], it is shown by Lanphier 
[109] that 
2 320 
6464+ 64E4 = =3) Eg — —A. 
9 3 
Taking the projection of both sides to their holomorphic parts, the identity 


” 2 320 
D7 max(m)(n — m)ox(n — m) = sn? ax(n) — Sr (n) 


m=0 
is obtained. Rearranging gives the identity 


n—-1 


T(n) = n?o7(n) — 540 m(n — m)o3(m)o3(n — m). 


m=1 
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On the other hand, consider the Rankin—Cohen bracket [E4, £6], which is a cusp 
form of weight 12 for the full modular group. Thus, it is a multiple of A. Unpacking 
this gives the relation 


4E4DE6 — 6E6DE4 = —3456A. (42) 
Also, we have the relation 
E\9 = E4 Es. 
This comes from the fact that the space of modular forms of weight k € [4, 10] is 
one-dimensional. Differentiating the above relation gives 
DE\9 = EgDE4 + EqDEg. 
Inserting this into (42) gives the identity 


inthe e230 S G-wRwae=w) 
T(n) = —-— — - —m). 
n 5429 n Tie n n—m)o3(m)os5(n —m 

m= 
Ramakrishnan and Sahu [167] show how one can obtain all of the results of [109] 
using Rankin—Cohen brackets in place of the Maass operators and holomorphic pro- 
jection. 


6 Quasi-modular Forms 
The Eisenstein series F2(z) (which is P in Ramanujan’s notation) is not quite a 
modular form. It satisfies the transformation property 
12c(cz +d) 
2ri 


b 
e(S = ) = (cz +d)? Ep(z) + 


czt+d 
a b 
(: D) € SL2(Z). 


More generally, Kaneko and Zagier [90] introduced the notion of a quasi-modular 
form to capture this behaviour. 

Let k and s denote two non-negative integers. A holomorphic function f on the 
extended upper half-plane (that is, the upper half-plane together with the cusps) 
is called a quasi-modular form of weight k and depth s if there are holomorphic 
functions Qo(f),..., Qs(f) with the property that for any 


a b 
(: ayer 


az+b\ | ~ c 
p( SEE) =tc+a Y ano(—5) 


i=0 


for any 


we have 


Thus, £2 is a quasi-modular form for SL2(Z) of weight 2 and depth 1. 
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By applying the identity element in the above, we see that a quasi-modular form 
f satisfies 


f = Q0(f). 


Moreover, quasi-modular forms have Fourier expansions as they are periodic. This 
can be seen by applying the element 


1 1 
') € SL2(Z) 


to a quasi-modular form f giving the relation 
FR+1) = Qo(f/)@) = Ff). 


Denote by Me the space of quasi-modular forms of weight k and depth s for 
the full modular group. The operator D preserves quasi-modular forms, and in fact 


~ <s ne 
D:M,. — Mes". 


Moreover, it is a fact that s <k/2. At the moment, the only examples of quasi- 
modular forms that we have are E> and derivatives of modular forms. It is a fact 
that all quasi-modular forms for the full modular group can be obtained in this way. 
In particular, 
k/2-1 
MEX? = GD Di Myvi @ CD*" By, 
i=0 
The functions Q;(f) in the definition of the quasi-modularity property are them- 
selves quasi-modular forms by a result of Martin and Royer [121], Lemma 19. In 


fact, Qo(f) = f and Qi(f) € M3; . 
The definition of the Rankin—Cohen bracket can be extended to the space of 
quasi-modular forms. In particular, Ramanujan’s differential equation 


DE) = —< (Es — E3) 
is a consequence of 
[E2, A]i = AE4. 
The relation 
[E2, E2]4 = —48A 


gives the Niebur identity 


n—l 
t(n) =n4o(n) — 245 °(35a4 — 52a°n + 18a°n”)o(a)o(n — a). 


a=1 
Moreover, the relation 


[E4, DE4]; = 960A 
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gives the van der Pol identity 


n—-1 
T(n) = n°03 (n) + 605° a(9a — 5n)o3(a)o3(n — a). 


a=1 


7 Non-linear Congruences and Their Interpretation 


Earlier, we saw that congruences satisfied by the t-function that related it to various 
of the ox functions could be interpreted in terms of Galois representations. More 
specifically, the reducibility modulo a prime £ of the ¢-adic representation attached 
to A resulted in a congruence for t(p) modulo ¢ for (almost all) primes p. 

In the case of non-linear relations, it is also possible to derive congruences by 
reducing modulo a prime that occurs in the support of the coefficients. For example, 
the van der Pol identity gives the congruence 


t(n) =n*03(n) (mod £) 
for € = 2, 3,5 (the primes dividing 60). In particular, 
t(p) = p°o3(p) (mod £) 


and it is this latter congruence that we seek to “explain”. 

If we consider the ring of (mod ¢) Galois representations associated to mod- 
ular forms, it has an algebra structure under ® product. If this algebra is finitely 
presented, then the set of relations will generate all possible quadratic (and higher- 
order) relations between the representations and hence also their characters. 


Chapter 9 
Mock Theta Functions and Mock Modular 
Forms 


1 Historical Introduction 


In his last letter to Hardy, written three months before his death in 1920, Ramanujan 
describes the beginnings of a new theory of what he called “mock theta functions.” 
He gave no precise definition of these objects. Rather, he listed seventeen examples 
and a qualitative description of their key properties, namely, that these functions 
have asymptotic expansions at every rational point similar to classical theta func- 
tions, but that there was no single theta function whose asymptotic expansion agrees 
at all rational points with that of the mock theta function. 

Since Ramanujan’s letter, many mathematicians like G.N. Watson (who had in 
his possession the famed “lost” notebook), A. Selberg, and G.E. Andrews studied 
the examples Ramanujan had given and gave rigorous proofs of some of the state- 
ments he had made. New excitement emerged in the 1970s when Andrews [7] found 
Ramanujan’s lost notebook among the papers of Watson and attempted to prove 
some of these conjectures. 

Indeed, in 1988, Hickerson (see [78, 79]) proved many of these conjectures. 
However, no unifying theory emerged from these papers. It was only in 2002, when 
S. Zwegers completed his doctoral thesis under D. Zagier, that we find in his the- 
sis an intrinsic characterization of mock theta functions. Later, Bringmann and Ono 
used this theory to resolve several open problems in combinatorics and the theory 
of q-series. 

Before we proceed, it may be useful to direct the reader to the lucid Séminaire 
Bourbaki survey of Zagier [206] in which he writes “Ramanujan used the word 
‘theta function’ where we would say ‘modular form’ today, so that ‘mock theta 
functions’ meant something like ‘mock modular forms’.” More precisely, a mock 
theta function is a mock modular form of weight 1/2. 

All of the seventeen examples of Ramanujan are of the form 


(oe) 
Y> An(q), 
n=0 
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where A,,(q) is in Q(q), and for all n > 1, An4i(g)/An(q) = R(q, q") for some 
fixed rational function R(q,r) € Q(q,r). Such series are called g-hypergeometric 
series. 


2 Ramanujan’s Examples 


What exactly was contained in Ramanujan’s letter? What were his seventeen exam- 
ples? Ramanujan divides his examples into three groups, four of “order 3”, ten of 
“order 5” and three of “order” 7, though he gave no definition of what he meant by 
the word “order”. Here are some examples: 


[o,@) 2 
— q 
1O= aap da 
= (-@)" 
6@=) Gata) Ue) 
25 (-@)" 
vg=>- 


d+p0+@)-- 0+ a) 


n=1 
There is some structural similarity of these functions to those appearing in the 
Rogers—Ramanujan identities. However, these functions are more complex. Ra- 
manujan states without proof that 
1 —2g +2g4 —2q?+-:-- 
Corgi bg Cat gee 
where the expression on the right-hand side is easily seen to be a modular form of 
weight 1/2 (up to a factor of g~!/2*). From this example we see there are linear 
relations among the mock theta functions like ¢ = f + 2w. The space spanned by 
them contains a subspace of ordinary modular forms, and after multiplying by a 
suitable power of g, we get modular behaviour. The other mock theta functions in 
Ramanujan’s list have similar properties. 

With all of these examples the following features emerge. If H(q) € Z[[q]] is 
one of Ramanujan’s mock theta functions, then H(q) acquires some modular trans- 
formation property after the following steps: (i) multiply H(q) by a suitable power 
of q; (ii) change q to e?”'* and set h(t) = e274* H(e?"'*); (iii) add a simple (but 
non-holomorphic) correction term so that the corrected function has modular prop- 
erties. 


2o(M -fM=fMt4VQe= 


3 The Work of Zwegers 


In 2001, Zwegers (see [206]), in his doctoral thesis, discovered the relation between 
non-holomorphic modular forms, Lerch sums and indefinite theta series. Zwegers 
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discovered that mock theta functions are really the “holomorphic parts” of real ana- 
lytic modular forms of weight 1/2. This allowed one to define higher weight “mock 
theta functions”, now called mock modular forms. Thus, a mock modular form is 
the holomorphic part of a weak Maass form, and there is a simple algebro-analytic 
description of these objects. 

To motivate our discussion, we begin with the Fourier expansion of the Eisenstein 
series of weight k, 


4k fr 
Ex@)=1— Doni)", 


n>1 


and the observation that the infinite sum can be rewritten as 


k-1 d _ k-1 qf 
DD a Dae Oar 


re 
d>1 m>1 d>1 q 
A more complicated sum that bears some superficial resemblance to this is 
n(3n+1)/2 


n@ 
—1)"——_.. 


neZ 


This function occurs in Ramanujan’s “lost notebook”. 
Following Zwegers (and the exposition by Zagier), consider the function of two 
complex variables 


(Sbytgn er? 


> 


1 
wu, v) = wu», = 


oy neZ l= Gn 


where g = e77'7, a =e?" and b = e77!”, and 6(v) is the function 
O(v) = Vbq'/® See, 
meZ 


which by the Jacobi identity is 


[o,@) 
9(v) = Vbq"8 | [dd —qn)(1 — bg")(1— 5! q""!). 
n=1 
Zwegers defines a function R(z; t) with z,t € C and t having positive imaginary 
part, such that the function 
wu, v, tT) — Ru —v,T) 
is a Jacobi form. Now using the principle that specializing the elliptic variable in a 
Jacobi form to a “torsion point” results in a usual modular form (up to a multiple of 
a power of q), one gets a modular description of mock theta functions. 
In particular, the function denoted f(g) in the previous section (which Ramanu- 
jan called a mock theta function of order 3) satisfies the identity 


00 n(3n+1)/2 


n oe = n@d 
[[G-a")f@ => OE) <a 


n=1 neZ 
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and Zwegers shows that if we set 


R(t)= > sgn(n)B(n2y/6)q-” 4 


n=1 mod 6 


pays [wie 


q!** fq) + R(t) 


transforms like a modular form of weight 1/2 for the principal congruence subgroup 
I (2) of level 2. 


where 


then 


4 The Space of Mock Modular Forms 


With Zweger’s theory in hand, Zagier defines the space M,; of mock modular forms 
in the following way. It will contain the space M Hy of weakly holomorphic modular 
forms of weight k. These are holomorphic functions on the upper half-plane that 
transform like modular forms of weight k (under the action of a group J”) and which 
may have a pole (of finite order) at each cusp. Then Mj, is an extension that fits into 
an exact sequence 


0 > Mi. > Mz — Mo. — 0 


where the first map is inclusion, and the second map to the space M2_, of modular 
forms of weight 2 — k (the so-called ‘shadow map’) is defined as follows. Given a 
modular form g € M>_,, define the transform 


se = a! [w+ 2) dw. 
=F 

This is essentially an Eichler integral. Then the ‘shadow’ of h € Mz, is the unique 
element g € Mo_; for which h + g* transforms like a modular form of weight k for 
a certain subgroup of SL7(R). (The shadow mapping is reminiscent of Serre duality. 
See also the last section in Chap. 2.) 

Neither the existence nor the uniqueness of g is evident from this description. 
However, there is an alternate description that will make this clear. Denote by M, 
the space of real analytic functions f that transform under I” by 


az+b = k 
(SH) -omec+a f(z) 


with a character p of I’. Define Mi to be the elements f € My for which 
Xz) OF /0z is anti-holomorphic, that is 


s =0. 
az =(¢ Be =) 
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The elements of Mi are called harmonic weak Maass forms. They are non- 
holomorphic modular forms which are eigenfunctions of the Laplacian with eigen- 
value k(2 — k)/4 and having at most exponential growth at the cusps. 


5 Some Applications 


With the formulation of the concept of a mock modular form, many arithmetic appli- 
cations have been developed. Some of these have to do with congruences satisfied by 
the partition function. Others have to do with central critical values of L-functions. 
We briefly describe the latter as formulated by Bruinier and Ono [26]. 

The work of Waldspurger established a relation between the Fourier coefficients 
of half-integral weight modular forms and central critical values of quadratic twists 
of the L-functions associated to integral weight modular forms, with the two forms 
in question being connected by the Shimura correspondence. In the case of Maass 
forms, work of Katok and Sarnak related Fourier coefficients of Maass forms of 
weight 1/2 to line integrals of certain Maass cusp forms. Bruinier and Ono consider 
the analogues of these results for harmonic weak Maass forms that are not weakly 
holomorphic modular forms (in other words, whose shadow is nonzero). Such forms 
have a Fourier expansion of the form 


fe(2= > ct(n)q" + Le (n)W (27nv)q" 


n>—0o n<O0 


where 
W(x) = W(x) = (1 —k, 2Ix|) 


is the incomplete Gamma function. The first sum is referred to as the holomorphic 
part of f, and the second sum as the non-holomorphic part. 

Now consider a normalized eigenform G(z) of weight 2 for [9(p) with the prop- 
erty that the sign of the functional equation in its L-function is —1. By Kohnen’s 
theory (which is an explicit form of the Shimura correspondence), there is a new- 
form g of weight 3/2 for [o(4p) in Kohnen’s plus space which lifts to G under the 
Shimura correspondence. Then, Bruinier and Ono prove that there is a weight 1/2 
harmonic weak Maass form f, (say) on I(4p) with the following property. If we 
write the Fourier expansion as 


fg= > cy (n)g” + Yo cy @)W2xnv)q", 


n>—0o n<0 


then for negative fundamental discriminants D for which (3) = 1, we have the 
Waldspurger-type formula 


|D 


LG, xp, 1) = 827 GI" llall See DY, 
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where || - || denotes the Petersson norm. Moreover, they show that for positive 
fundamental discriminants D for which (*) = 1, we have that the derivative 
L(G, xp, 1) = 0 if and only if cE (D) is algebraic. 

Finally, they make interesting conjectures about the number of transcendental 
Fourier coefficients and the number of non-zero Fourier coefficients. 

All of this opens a new world of mathematics. The reader will profit much by 
reading the excellent survey by Ono [152] to get a glimpse of more wonders. It is 
thus fitting to close this informal discussion with some prophetic words of Freeman 
Dyson [45]: “The mock theta functions give us tantalizing hints of a grand synthesis 
still to be discovered. Somehow it should be possible to build them into a coher- 
ent group-theoretical structure, analogous to the structure of modular forms which 
Hecke built around the old theta functions of Jacobi. This remains a challenge for 
the future.” With these recent advances, Dyson’s dream may soon be realized. 


Chapter 10 
Prime Numbers and Highly Composite Numbers 


1 The Divisor Functions 


The structural properties of natural numbers and their detailed study forms a signifi- 
cant chapter in analytic number theory. In his memoir on highly composite numbers, 
Ramanujan initiated an important method to study general arithmetic functions. This 
method has become a dominant theme in current research. Surprisingly, these stud- 
ies take us into allied areas of transcendental number theory and an intimate study 
of the Riemann zeta function. 

Ramanujan’s memoir of 1915 dealing with highly composite numbers begins 
with the divisor function d(n) which is the number of divisors of n. It is easy to see 
that if 

a a ak 
n= pi! p52 +> py 
is the unique factorization of n as a product of distinct prime powers, then 
d(n) = (a, + I(aat+1)--- (a +1). 


There are many ways one can try to understand the nature of such a function. For 
example, one can try to understand the average order by studying 


Sdn). 
n<x 
This was done by Dirichlet who proved that 
Yo dn) = xlogx + Qy — x + O(x"”), 
n<x 


where y is Euler’s constant. Thus, on average, d(n) behaves like logn. The cele- 
brated Dirichlet divisor problem is to find the infimum of all 6 such that 


Y\d(n) = xlogx + Qy - 1)x + O(x*). (43) 


It is conjectured that this infimum is 1/4. After Dirichlet, Voronoi proved in 1904 
that 6 < 1/3. In 1916, Hardy showed that 6 > 1/4. After a succession of powerful 
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and ingenious methods in the theory of exponential sums, the current record is 0 < 
131/416 due to Martin Huxley in 2003. 
One can also study “moments” of the divisor function 


V~F@ 


and try to obtain asymptotics for this. This approach was initiated by Ramanujan. 
For example, he proved that 


yd? (n) = Ax log? x + Bx log? x + Cxlogx + Dx + O(x3/5+¢), 


n<x 


for any € > 0. He noted that the error term can be improved to O(x!/2+*) on the 
Riemann hypothesis. This should be compared with a result of Ingham [84], who 
proved in 1927 that 


Yo d(n)d(n +.) ~ Sox log? x 


n<x 


as x tends to infinity. In other words, d(n) and d(n + A) (A £0) have little “corre- 
lation” since this is significantly of lower order by a factor of log x if A 4 0. More 
generally, he obtains on the Riemann hypothesis that 


Y\d* (n) = x Py(logx) + O(x'/?**), 
n<x 


where P;(t) is a polynomial of degree 2* — 1. In this context, Ramanujan studies 
the Dirichlet series 
d?(n) 


ns 


p18 


n=1 

and more generally 

[o,e) 

- Jq(n)op(n) 

ns 

f=) 
where o4(n) = >> d\n @“ are the generalized divisor functions. This is a precursor to 
the general construction of a Rankin—Selberg convolution in the theory of automor- 


phic L-functions. Indeed, Ramanujan proves that 


CO 


3 Fa(nyon(n) _ S(s)b(s — a(S — b)G(s — a — b) (44) 
— ns ¢(2s—a—b) 
and, moreover, that 
y Oa(njon(n)x(n) _ L(s, x)L(s —a, x)L(s — b, x) L(s —a —b, x) 
= ns . L(QQs —a—b, x?) 


where x is the non-trivial Dirichlet character (mod 4). Both of these equations are 
based on a simple power series identity noted by Ramanujan, namely 


1 The Divisor Functions 137 


[o,@) 
S424 242 42")(Lt wt w? +--+ w")q" 
n=0 


- 1 —zwq? 
~ (L=q)U = zq)(1 — wq)( — zwq) 


Estermann [51] considered the more general Dirichlet series 


ye 


n=1 


where d,(n) is the number of ways of writing n as a product of k natural numbers. 
In this context, several cognate sums appear, and they are of two types. The first is 
of “Rankin—Selberg” type, and the second seems to be of “modular type” at least in 
a majority of cases. An example of the first type is the sum 


PS d(n)d(n +A), 


n<x 


mentioned above, and more generally 


Yo oa(n)ow(n +A), 


n<x 


both of which were studied by Ingham in [84], who derived asymptotic formulas 
for them. Indeed, their asymptotic behaviour had been conjectured by Ramanujan 
in his 1915 paper. The second type involves 


Yo oa(jov(n — J). 


j<n 


These convolutions can be evaluated explicitly in terms of Fourier coefficients of 
modular forms when a, b are odd and > 3. In fact, Ramanujan showed that the 
above sum is asymptotic to 


Pat+)rbo+Dea+beb+)) 
Via+b+2)e(at+b-+ 2) 


and conjectured that the same must hold for other values of a, b. According to Bruce 
Berndt’s commentary, in 1979, Grosjean [60] disproved this conjecture, but the pa- 
per seems lodged in a journal that is not so easily accessible. However, in 1957, Hal- 
berstam [66] seems to have also disproved the conjecture using the circle method 
and estimates for Kloosterman sums. (There is also a later work by Motohashi [127] 
related to this problem.) Halberstam proves that there are additional terms to the 
asymptotic formula and when a and b are both odd, these terms disappear. 

If at least one of a, b is even, there seem to be no explicit formulas of the type 
Ramanujan derived. One only has asymptotic formulas. A theoretical explanation 
of why such formulas are missing would be welcome since there is no apparent 
modular connection and would perhaps open a new area of research. 


Oatb+1(n) 
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2 Ramanujan and the Prime Number Theorem 


Ingham [85] noted that Ramanujan’s identity (44) implies the non-vanishing of ¢ (5) 
on t(s) = 1. It is well known that this is equivalent to the prime number theorem, 
which asserts that the number of primes p < x, denoted r(x) satisfies the asymp- 
totic law 


m(x) ~ 
logx 

as x —> oo. This was proved by Hadamard and de la Vallée Poussin (independently) 
in 1896. Using an analogue of (44) for L-series, Ingham [85] noticed we also get 
non-vanishing of Dirichlet L-series on t(s) = 1, a fact which is equivalent to the 
prime number theorem for arithmetic progressions. This says that if w(x, q,a) is 
the number of primes p < x which are congruent to a (mod q), with (a, q) = 1, we 
have the asymptotic 


18 (x J qd * a) ae ’ 
p(q) logx 
where ¢ denotes Euler’s function. 
As noted by Riemann in his celebrated paper of 1860, the function 


; ; I-e at x dt 
lix := lim — + —— 
«>0\ Jo logt i+e logt 


is a better approximation to (x) than x / log x in the sense that the effective version 
of the prime number theorem of Hadamard and de la Vallée Poussin can be stated 
as 


(x) =lix + O(xe°V'8*) 


for some constant c > 0. Elaborate techniques in the theory of exponential sums 
have only led to modest improvements of this result. By a 1958 result of Korobov 
and Richert, we know that 
2/3 
n(x) =lix + O(x~1008*)"") 
for some positive constant c; > 0. There has not been any further improvement. Any 
result of the form 


m(x) =lix + O(x°) 


with @ < 1 is equivalent to the assertion that ¢(s) £0 for {i(s) > 6. Thus, if we 
are allowed to take any 6 > 1/2, this is equivalent to the Riemann hypothesis. An 
excellent exposition of the relationship between exponential sums, zero-free regions 
of ¢(s) and error terms in the prime number theorem can be found in [198]. In [16], 
Balasubramanian and Ramachandra show that a suitable extension of (44) imme- 
diately leads to the prime number theorem for arithmetic progressions with error 
terms. 

In his first letter to G.H. Hardy sent on 16 January 1913, Ramanujan wrote, 
“T have found a function which exactly represents the number of primes less than x, 
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‘exactly’ in the sense that the difference between the function and the actual number 
of primes is generally 0 or some finite value even when x becomes infinite. I have 
got the function in the form of infinite series and have expressed it in two ways: 


(1) in terms of Bernoulli numbers ... 
(2) as a definite integral ...” 


In his second letter, Ramanujan gave a third expression for (x). 
These assertions can be given the following form. Using the notation of Hardy 


[67], let 
ey as (log x)! dt 
1) =f it¢@+DFe+))’ 


[ee 


4 k logx \7*"} 
ey vam Qn ) ; 


k=1 


ioe) -A/k 

Lk) [ dt 
R(x) = —— —— 
@) a k Je logt 


k=1 


“where c = 1.45136380... nearly” to quote from Ramanujan’s second letter of 27 
February 1913. 

The last series, R(x), appears for the first time in Riemann’s 1860 paper, which 
Ramanujan could not have read. The series G(x) is related to one discovered by 
Gram. But the series J(x) was due to Ramanujan. Hardy [67] is of the opinion that 
Ramanujan discovered all these series for (x) independently. 

Ramanujan’s assertions imply that if F(x) is any of these three functions, 


u(x) — F(x) = O(1). (45) 


In his essay on Ramanujan’s work on prime numbers, Hardy [67] proceeds to show 
how (45) is false. Indeed, one can show that 


J (x) = R(x) + o(1) 
and 
G(x) = R(x) + 0(), 
so that one need only study the validity of 
w(x) = R(x) + OCI). 
From this would follow 
n(x) =lix + O(x!/*t) 


for any € > 0, an assertion already noted to be equivalent to the Riemann hypothesis. 
However, the result also implies, as indicated by Hardy [67], 


1 
n(x) =lix — 5 lixt? + O(x!3+¢) 
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for any € > 0. In particular, this implies 
(x) —lix > —oo (46) 
as x — oo. But in 1914, Littlewood showed that 


'/? log log] 
n(x) —lix= 2, (~ Eee a “2*). 
logx 


This means that there are positive constants c;, cz such that 


. x!/? log log log x 
w(x) —lix > a os 
ogx 


for infinitely many x tending to infinity and 


. x!/? log log log x 
m(x) —lix <—c2 


logx 
for infinitely many x tending to infinity. Thus, (46) is false. 

In his letters, Ramanujan also made interesting assertions about what we now 
call “prime number races.” In his first letter, he wrote “The difference between the 
number of prime numbers of the form 4 — | and which are less than x, and those of 
the form 4n + 1 less than x, is infinite when x becomes infinite.” In other words, if 
we denote by z4(x) the number of primes p < x with p =a (mod 4), witha = 1,3, 
then Ramanujan’s assertion is that 


3(x) —11(x) > co (47) 


as x tends to infinity. Now from the prime number theorem for arithmetic progres- 
sions we have 


1 
(xX) ~ 13(xX) ~ 5 lix 


as x tends to infinity. It is surprising that Hardy does not discuss this in his book 
since (47) was shown to be false by Littlewood [112] in 1914. 

The theme of “prime number races” is a topic of current research. In the 1960s, 
Knapowski and Turan [99] wrote a series of papers developing a new branch of 
number theory called comparative prime number theory to tackle such questions. 

However, the study of 73(x) — 71(x) goes back to Tchebycheff [195] in 1853, 
who made two assertions. Let 


A(x) = 13(x) — m1 (X). 
(1) Then, there exists an infinite sequence of real numbers {x,} tending to infinity 
such that 
. Ary) log x; 

lim —————-_ = 1; 

r>oo /X;- 
(2) define the function 

Fors Foe re? 


Pp 
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where the sum is over prime numbers. Then, as o > O07, the function f(o) 
tends to —oo. 


The reader will note that in the summation in (2), primes congruent to | (mod 4) 
are weighted with a + sign and primes congruent to 3 (mod 4) are weighted with 
a — sign. Ramanujan’s assertion can be seen as a variation on assertion (2) of 
Tchebycheff. Naively, the assertion says that there are “more” primes congruent 
to 3 (mod 4) than there are primes congruent to 1 (mod 4). As mentioned earlier, 
Littlewood showed in 1918 that 


x!/? log log log x 
n(x) — n(x) = 4 (~—PR SEES a ) 
log x 


so that Tchebycheff’s assertion (1) is true. In fact, the limit | can be replaced by 
any real number, and the assertion is true. Regarding (2), Hardy and Littlewood [68] 
show that if x is the non-trivial character (mod 4) and 

[o,@) 


L6.0=>> x) 


n=1 


satisfies the analog of the Riemann hypothesis, then there exists an H > 0 such that 


SY ene vee eS 
; Vo log(1/o) 

for all sufficiently small values of o. Thus, (2) follows on the Riemann hypothesis 
for L(s, x). 

On the other hand, Landau [105] showed that if L(s, x) does not satisfy the 
Riemann hypothesis, then 

—coo=liminf f(o) and limsup f(o) =+o0. 
o>0t o> 0t 

Thus, the function in (2) has no limit as o — O* if the Riemann hypothesis for 
L(s, x) is false. Put in another way, (2) implies the Riemann hypothesis for L(s, x). 

The more general problem of studying 

u(x, q,a) a U(x, q; a2) 

has been explored in detail in a series of papers by Knapowski and Turan [99]. 

In his letter of February 1913, Ramanujan wrote that 

M(X,q, 41) — W(X, q, a2) > 00 


for the triples (q¢; a1, a2) = (4; 3, 1), (6; 5, 1), (8; 3, 1) and (12; 5, 1). Of course, all 
of these assertions are false, but they hint at the fact that Ramanujan gave some 
thought to such oscillation questions in India, before his journey to England in 1914. 


3 Highly Composite Numbers 


What is perhaps surprising about the “Twelve Lectures” is the arrangement of top- 
ics of Hardy’s lectures, as well as the omissions. The first four of the lectures are 


142 10 Prime Numbers and Highly Composite Numbers 


devoted to classical analytic number theory. But Ramanujan’s paper on highly com- 
posite numbers finds no discussion. 

A number n is said to by highly composite if d(m) < d(n) whenever m <n. His 
paper discussing highly composite numbers is over sixty pages and signals a general 
method by which to study other arithmetical functions. 

To understand Ramanujan’s work in this context, we review some history of re- 
sults concerning the divisor function. It is elementary to show that 


d(n) = O(n‘) 


for any € > O (see, for example, [131]). Using the prime number theorem, 
Wigert [203] proved in 1907 that for any € > 0, 


d(n) < 9+) logn/loglogn (48) 
for all n sufficiently large and that 
d(n) > 7(1-e) logn/loglogn a 


for infinitely many n. This result determines the maximal order of the divisor func- 
tion. Ramanujan proved (48) and (49) independently without using the prime num- 
ber theorem. In fact, he proved the stronger assertion that the maximal order of 
logd(n) is 

(log 2) idogn) + O ((log n) exp(—c,/ log n)) 


for some positive constant c > 0. The essential idea is as follows. 
Let 


O(x) = > log p, 
psx 


where the sum is over primes p < x. Suppose that N has k distinct prime factors. 
Then, it is easy to show that 


[((pe) + log N)/k]k 
(log p1) +++ (log px) 
where p; denotes the jth prime number. Now partial summation shows that 


m(x)logx — (x) = f° *OM (50) 
2 


t 


d(N) < 


If we have the prime number theorem, then it is easy to deduce 


O(pk) ~ Pk ~ klogk. 


But one can deduce 6(px) ~ klogk from (50), and this is essentially Ramanujan’s 
idea (see also Robin [174]). 

As noted by Nicolas [146], Ramanujan’s method also determines the maximal 
order of r(n), the number of ways of writing n as a sum of two squares. One finds 
that r(7) has the same maximal order as the divisor function. Nicolas [147] points 
out that this result is given incorrectly on p. 270 of the classic textbook by Hardy 
and Wright [72]. 
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In his paper, Ramanujan introduces the notion of a “superior highly composite 
number.” N is said to be a superior highly composite number if for some € > 0, we 
have 

d(m) es d(N) 


me ~— NE 


for all m < N. Since 
d(m) < (m/N)*d(N) < d(N) 


for all m < N, we see that a superior highly composite number is also highly com- 
posite. 

Ramanujan gave a complete classification of highly composite numbers and su- 
perior highly composite numbers. Surprisingly, there is an interesting connection to 
transcendental number theory that we explain in the next section. 


4 Relation to the Six Exponential Conjecture 


Define 
peat log(1 + 1/k) 
~ log p 


If e« ¢ €, then the maximum of d(n)/n‘* is attained at only one integer N., and we 
have 


; p prime, k= 1,k = 2]. 


Ne=[]p%. ap =[1/(0°-1)] 


By the six exponentials theorem explained below and proved by Siegel, we know 
that for real A and three distinct primes p, q,7r, the numbers 
Moe oR 
P5q 5? 
cannot all be rational unless 4 is an integer. This implies that three elements of € 
cannot be equal. 

The six exponentials theorem is easy to explain. Suppose that x1, x2 are linearly 
independent over Q and yj, yz, y3 are linearly independent over Q. Then, at least 
one of the six exponentials e*/, | <i <2 and 1 < j <3, is transcendental. It is 
conjectured that one can take y,, y2 linearly independent over Q and at least one of 


the four exponentials e*’ with 1 <i, j <2 is transcendental. 
Now suppose that three elements of € are equal. Then we have 


so logd+1/kp)  logd+1/kg) _ log + 1/k;) 


log p loggq logr 
for three distinct primes p, g, r and positive integers kp, kj, ky. Then, 
1 1 1 
d h h 
=l1+—, =14+—, =14+— 
2) + kp qd oe ke r oF a 
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are all rational. If A is irrational, this contradicts the six exponentials theorem since 
we may take x; = 1, x2 =A and y; = log p, y2 = logg, y3 = logr in the theorem. 
Therefore, A is rational. But p* = 1+ 1/k p being rational implies that A must be a 
positive integer. Now, 
= logd + 1/Kp) _ tea 

log p ~ log p 
and hence p* < 2, which means p = 2 and A = 1. The same applies for g and r, a 
contradiction, since p, q,r are distinct. 

The connection to the six exponentials theorem was made by Alaoglu and Erdés 
[6]. The four exponential conjecture would imply that all elements of € are distinct. 
If true, this would mean that d(n)/n* attains its maximum at two integers. If false, 
the maximum would be attained at four integers for some €. As Nicolas [147] points 
out, this was overlooked by Ramanujan. 


Xr 


5 Counting Highly Composite Numbers 


Ramanujan devoted a section of his paper to the study of Q(x), the number of highly 
composite numbers < x. Since d(2n) > d(n), we see that between x and 2x, there 
is always a highly composite number. Indeed, taking the largest highly composite 
number v in the interval [1, x], we have d(v) > d(n) for alln <x, n Av. Now 
2v € [x, 2x] and d(2v) > d(v), so that Q(x) > logx. In his paper, Ramanujan [160] 
showed that 


lim Qt) = +00 
x>00 logx 


Clearly, the determination of Q(x) was of significant interest to Ramanujan. In their 
joint paper [70], Hardy and Ramanujan wrote “The problem of determining the 
number Q(x) of highly composite numbers not exceeding x appears to be one of 
extreme difficulty. It is still uncertain whether or not the order of Q(x) is greater 
than that of any power of log x.” 

Several mathematicians afterwards took up this problem. In 1944, P. Erdos 
showed that for some cy > 1, Q(x) > (logx)°!. The new ingredient was Hoheisel’s 
theorem [82]: 

8 


n(x +x°) —x(x)> 


(51) 
logx 
Hoheisel’s result has since been improved, but the connection to the distribution of 
primes in short intervals is important. One expects that (51) is true with any 6 > 0. 
The Riemann hypothesis would imply that any 5 > 1/2 is permissible. 

In 1971, Nicolas [145] proved that there is a constant c2 such that 


Q(x) < (log x). 


This resolves one of the questions raised by Hardy and Ramanujan. The order of 
Q(x) is not greater than any power of log x. 
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Here again, Nicolas [145] used a result from transcendental number theory to 
prove the theorem. More precisely, he used Feldman’s theorem: let 


» _ 1083/2) 
~  log2 ° 

There exists a « such that for all integers u, v > 1, we have 
|vd@ —u|>v “. 


The question of determining optimal values of c,, cz remains open. Nicolas re- 
marks that if we assume two conjectures, namely that (51) holds for any 5 > 0 and 
that for all 7 > 0, there exists a constant B(7) = B such that for all u, v, w € Z, we 
have 


|ulog5 + vlog3 + wlog2|> B((|u| + 1)(lvl + Ho, 
then writing Q(x) = (log x)°“, we have 
log 30 


lim sup c(x) = 
pate ”) log 16 


= 1.227... 


Unconditionally, we have 
1.136<cy <c2 < 1.71. 


All of these results are based on our knowledge of Diophantine approximations of 
(log(3/2)/log 2). 


6 Maximal Order of Divisor Functions and Other Arithmetic 
Functions 


In his paper on highly composite numbers, Ramanujan studied related arithmetical 
functions like the generalized divisor functions dy (n) counting the number of ways 
of writing 1 as a product of k numbers. He also studied the generalized divisor sum 
functions 0, (7), the sum of the sth powers of the positive divisors of n. However, 
these were not published. In fact, sections 53 to 75 of his paper on highly composite 
numbers were expunged in the final publication since the London Mathematical 
Society was having financial difficulties. Indeed, in the first edition of Ramanujan’s 
collected papers, the editors wrote, “The paper, as long as it is, is not complete. 
The London Mathematical Society was in some financial difficulty at the time and 
Ramanujan suppressed part of what he had written, in order to save expense” (see 
p. 238 of [145]). Fortunately, the suppressed part has been preserved and has since 
been published in the first volume of the Ramanujan journal [166]. The sections of 
the unpublished part of [160] were in a box of G.N. Watson’s papers kept at Trinity 
College Library and accidentally discovered by G.E. Andrews along with the now 
famous “Lost Notebook” of Ramanujan. 

One would wonder at this strange excision of a complete manuscript by the Lon- 
don Mathematical Society. Nicolas and Robin in [166] wrote, “Hardy did not much 
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like highly composite numbers. In the preface to the “Collected Works” he writes 
that “The long memoir [160] represents work, perhaps, in a backwater of mathemat- 
ics,” but a few lines later, he does recognize that “it shews very clearly Ramanujan’s 
extraordinary mastery over the algebra of inequalities.’ One of us can remember 
Freeman Dyson in Urbana (in 1987), saying that when he was a research student of 
Hardy, he wanted to do research on highly composite numbers, but Hardy dissuaded 
him as he thought the subject was not sufficiently interesting or important.” This ex- 
plains to some extent why the topic did not find a place in Hardy’s twelve lectures 
[67]. This is the second occasion in which Hardy used the phrase “backwater of 
mathematics” referring to some aspect of Ramanujan’s work. 

In his unpublished sections of [160], Ramanujan extended the notion of a highly 
composite number to other arithmetical functions such as r2,(n), the number of 
ways of writing n as a sum of 2k squares. He also studied the maximal order of 
rg(n), the number of ways of writing n as Q(x, y) with Q being a positive defi- 
nite binary quadratic form. As noted above, he considered maximal orders of d;(n) 
and o;(n). 

For example, the notion of a superior highly composite number extended to the 
study of o(n) is a number 7 such that 


a(m) _ a(n) 
< 
m n 
for all m <n. Such numbers are now called superabundant numbers. Colossally 
superabundant numbers are numbers n for which there is a positive € > 0 such that 
a(n) _ o(m) 
nite — mite 


for all m > 1. It turns out that the set of colossal superabundant numbers is infinite 
and is a subset of the superabundant numbers [6]. The study of these numbers was 
initiated by Ramanujan [166] in the suppressed portion of [160]. 

One particular result here requires special mention. The maximal order of 
o_\(n) =o(n)/n is eY loglogn, where y is Euler’s constant. From formula (382) 
of Ramanujan’s paper [166] it follows under the Riemann hypothesis that for no 
sufficiently large, 


a(n) < e’nloglogn. 


Robin [174] shows that if this inequality holds for all n > 5040, then the Riemann 
hypothesis follows. Lagarias [103] showed that the Riemann hypothesis is equiva- 
lent to 


o(n)< A, + elin log An, 


where Hi, = pee <n 1/j. This particular reformulation finds a place in logic, where 
it was used to show that the Riemann hypothesis is “Diophantine.” In fact, this 
equivalence of the Riemann hypothesis is intimately connected with the study of 
highly composite numbers. 
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As a consequence of Deligne’s work [39], we know that 
|z()| <n'*d(n), 


and more generally, if A ¢(m) is the nth Fourier coefficient of a normalized Hecke 
eigenform f of weight k, then 


|A¢(n)| <n®—P a(n). 


One would like to determine the maximal order of t(7) and generally A f(n). In 
other words, are these inequalities sharp? From our earlier discussion about the 
maximal order of d(n) we see that the above implies 


1 
Af) = o(ntoe exp( 08" )) 
n 


for some positive constant c. For the full modular group, the reader will recall that 
normalized Hecke eigenforms have real coefficients, and so, the question of whether 
these coefficients get very close to the given bound above is the question of its 
maximal order. R. Murty [128] showed that 


1 
Af(n) = 24(n"-?P exp — )) 
oglogn 


for some c > 0. For this purpose, he needed to use analytic properties of symmetric 
power L-functions attached to the automorphic representation associated to f. With 
recent progress on the Sato—Tate conjecture, it is likely that new ideas can be injected 
into the further study of such oscillation results. Indeed, the maximal order of t (7) 
can now be determined explicitly, and one can show that the optimal value of c in 
the above result is c = log 2. 


Chapter 11 
Probabilistic Number Theory 


1 The Normal Order Method 


In their fundamental paper of 1917, Hardy and Ramanujan [69] introduced the con- 
cept of “normal order” of an arithmetical function. To be precise, this means the 
following. Given an arithmetical function g(n) and an increasing function ¢(n), we 
say that g(n) has normal order $(n) if for any € > O, the number of n < x such that 


|g(n) — (n)| > eo) (52) 


is o(x) as x — oo. In other words, for almost all natural numbers n (in the sense of 
natural density), we have 


(1 —€)o(n) < gin) < + €)o). 


Some arithmetical functions like the divisor function, for example, may not possess 
a normal order. But other functions like w(n), defined as the number of distinct 
prime factors of n, do have a normal order. In fact, the main result of [69] is that the 
normal order of w(n) is loglogn. 

The paper of Hardy and Ramanujan laid the foundations of a new subject called 
probabilistic number theory. Since then, the subject expanded into a major discipline 
at the hands of P. Erdds, M. Kac, J. Kubilius and P.D.T.A. Elliott, to name a few. 
In fact, by 1972, the subject became so enormous that Elliott wrote a two-volume 
treatise explaining the developments since the time of the Hardy—Ramanujan paper. 
In the recent past, there have been further developments especially to the study of 
Fourier coefficients of modular forms. We summarize these at the end of this chapter. 

In their seminal paper, Hardy and Ramanujan make use of Brun’s sieve to prove 
that w(n) has normal order loglogn. In 1934, Turan [196] showed how one can 
derive the Hardy—Ramanujan theorem without Brun’s sieve and using what can be 
viewed as Tchebycheff’s inequality. Apparently, this paper of Turan was part of his 
doctoral thesis written under the direction of L. Féjer, and it seems that Turan was 
unaware of Tchebycheff’s inequality or probability theory. As the proof is quite 
simple, we reproduce it here. 
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It is fairly straightforward to show that 
a(n) = x loglogx + O(x) 
n<x 
and 


Y\ wn) = x(loglog.x)” + O(« log log x). 


n<x 


Putting these two together, we get 


S (win) — log logn)” = O(« log log x). 


n<x 


Thus, for any function 6 with 6(n) — oo as n > ov, we have the number of n < x 
for which |w(n) — loglogn| > 6(n)(loglogn)!/? is O(x/0(x)), which is o(x). 

One can actually sharpen this and create a sieve method. This was done in [113] 
and [115] and expanded to a combinatorial setting in [114]. The method therefore 
has immense potential and applications beyond number theory. 


2 The Erd6s—Kac Theorem 


It is interesting that Hardy and Ramanujan chose the word “normal” to describe 
(52) since, as it transpired later on, there is an intimate connection to the normal 
distribution in probability theory. To elaborate, let O(n) be any function tending to 
infinity as n tends to infinity. Hardy and Ramanujan proved that the number of n < x 
such that 


|(n) — loglogn| > 6(n) (log logn)'/? (53) 


is o(x) 
In 1940, Erdés and Kac [49] proved a sharper result. Namely, they showed that 
if 
w(n) — loglogn 
N =#in<x:a < ——__—__ =< ; 
) {usa = /loglogn = 


then 


N 1 B 2 
me =| et lay, 
x00 x /O1 a 


Thus, the distribution of values of @(n) has an intimate connection to the normal 
distribution. 

Erd6és and Kac [49] proved a more general theorem. To explain this, we re- 
call some definitions. An arithmetical function f is called additive if f(mn) = 
f(m) + f(n) for all coprime m,n. An additive function is called strongly additive 
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if f(p") = f(:~) for p prime and all natural numbers a. If f is a strongly additive 
function satisfying | f(p)| < 1 for all primes p, let 


An = I) 
psn P 
and 
2 1/2 
B, = ( f 2) 
pen P 
Let 
Np(a) =H] exces ie <6}. 
Then, 


N 1 B 
i — | eat. 
x-> CO x /2n a 


Later, Kubilius and Shapiro (independently) (see Theorem 12.2 of [46]) generalized 
the Erdés—Kac theorem by eliminating the condition that | f(p)| < 1 and replacing 
it with the condition that for each € > 0, 


1 2 
lim, > LY _ 6, 
noo B2 . D 

” ps<n;\f(p)|>€Bn 


It turns out that the same conclusion now holds. 


3 The Hardy—Ramanujan-Type Theorem for the t-Function 


The theorem of Deligne connecting the theory of €-adic representations to Fourier 
coefficients of Hecke eigenforms opened the door for obtaining many new results 
regarding the arithmetical nature of these coefficients. In particular, one can now 
study the number of prime factors of t(p) and other arithmetical functions aris- 
ing from modular forms. Unfortunately, to establish these results, some form of a 
quasi-Riemann hypothesis needs to be assumed for the non-abelian Artin L-series 
associated to the ¢-adic representation. We explain why. 

More precisely, this theorem says that if Gg is the absolute Galois group of Q 
over Q, then there exists an ¢-adic representation 


pe: Gq GLy(Zz), 


where Z, denotes the ring of ¢-adic integers such that if 0, denotes the Frobenius 
automorphism, then 


tr(pe(ap)) =t(p),  detpe(op)=p'', pH#é. 
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In particular, if we want to study the distribution of values of t(p) modulo £”, then 
this means that we can study the finite Galois extension Kg» and the representation, 


Pen: Gal(K en /Q) > GL2(Z/e"Z). 


Ribet has analyzed the image of this representation. It consists of matrices whose 
determinant is an 11th power (mod ¢”). 

Using this representation, the study of t(p) (mod £) is reduced to the study of 
the distribution of the Frobenius automorphism in Gal(K¢/Q). But this distribution 
is given by the Chebotarev density theorem. Assuming the GRH allows us to use an 
effective version of this theorem to control error terms that arise in the analysis. It 
turns out that the full strength of the GRH is not essential in such theorems. Rather, 
a quasi-GRH, which assumes a zero-free region for the zeta functions in some fixed 
half-plane to the left of i(s) = 1 is sufficient for our purpose. 

In [132], assuming a quasi-GRH, we showed that 


! 2 
~ (w(t (p)) — log log p) = O(x(x) log logx), 
psx 
where the dash on the summation indicates we sum over primes p for which 
t(p) #0. Under the same hypothesis, we proved that 


1 2 
Y (ot (n)) - 3 log log ny) <« x(log log x) (log log log log x). 


n<x 
In particular, for any € > 0, we have 


|o(t(p)) = log log p| < (log log p)i/te 
for all but o(x/logx) primes p < x. 

There is nothing particular about t(p) or t(”). One can establish similar results 
for Fourier coefficients of normalized Hecke eigenforms of arbitrary weight and 
level. These results can be viewed as the modular analogues of the classical theorem 
of Hardy and Ramanujan. 

In [133], we extended these results and showed that 


w(t(p)) — loglog p 


{p<x.c(p) 40:0 <6} ~x0@(@.p) 


(log log p)!/? 
where 
1 B 2 
(a, p)=—— | eat. 
J 2 a 
Also, 


1 2 
— 5(logl 
wn Sete w(t(n)) — 5(loglogn) 
(loglogn)3/2/./3 
with both theorems using a quasi-GRH. 


Of course, analogous results hold for Fourier coefficients of any Hecke eigenform 
with rational integer coefficients. 


= a ~xP(a, B), 
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4 Non-abelian Generalizations of the Hardy—-Ramanujan 
Theorem 


The methods used to derive modular analogues of the Hardy—Ramanujan theorem 
extend to the study of other arithmetical functions which are not necessarily Fourier 
coefficients of modular forms or functions related to an ¢-adic representation. In- 
deed, if we have a multiplicative function f such that the set of primes p for which 
£| f (p) can be described by Chebotarev conditions, then one can apply similar meth- 
ods to derive analogous results. 

A good example is the case where f(n) = ¢(n), Euler’s ¢-function. In this set- 
ting, one can study w(f(p)) which is the same as the study of w(p — 1), and one 
can also study w(@(n)). This was first done by the authors in [132]. Independently 
and later, Erdés and Pomerance [50] proved that the number of n < x for which 


_ O(n) = x (loglogn)? 
(log logn)3/?//3 
is asymptotically x®(a, 6). Here the use of the quasi-GRH is not needed, and one 
can use the Bombieri—Vinogradov theorem to make the result unconditional. 

For the method to work, it is not even essential to have a multiplicative function. 
To study the divisibility of any sequence of numbers, it is possible to apply these 
techniques as long as one can relate it again to the Chebotarev density theorem. We 
give one such example of this type of theorem. 

Let a be a natural number which is not equal to +1 or a perfect square. A classical 
conjecture of Artin predicts that a is a primitive root (mod p) for infinitely many 
primes p. This problem is still open though some spectacular progress has been 
made in the recent past. In this context, it is natural to study f,(p), the order of a 
mod p. Let f,(n) be the order of a (mod n) with n coprime to a. Under the quasi- 
GRH, M. Ram Murty and his student F. Saidak [136] showed that the number of 
primes p < x for which 


oy < 2 Lfa(p)) — loglog p 
a log log p 
is asymptotic to m(x)®(q, B). Similarly, under the same hypothesis, the number of 
n <x for which 


a 


1 2 
a < faln)) = ploglogny _ 
(loglogn)3/2/./3 
is asymptotic to x®(a, B) as x tends to infinity. 
Clearly, the normal order method is a powerful method to analyze the arithmetic 


structure of complicated arithmetical functions. Its potential has not yet been fully 
exhausted. 


Chapter 12 
The Sato—Tate Conjecture for the Ramanujan 
t-Function 


1 Introduction 


It was Serre [185] who first conjectured that the Sato—Tate distribution holds for the 
Ramanujan t-function. More precisely, he conjectured that the numbers 


T(p) 


2pii/2’ 


as one ranges over primes p, are equidistributed in the interval [—1, 1] with respect 
to the Sato—Tate measure 


=y 1—x?dx. 


In fact, Serre expected that an analogous result should hold for the Fourier coef- 
ficients of any normalized Hecke eigenform not of CM type. This was recently 
proved by Barnet-Lamb, Geraghty, Harris and Taylor [18]. In particular, this applies 
to Hecke eigenforms of weight 2 not of CM type. By the celebrated modularity 
conjecture, we know that elliptic curves correspond to Hecke eigenforms of weight 
2 with integer Fourier coefficients. Consequently, the “angles of Frobenius” for el- 
liptic curves not of CM type are distributed with respect to the Sato—Tate measure. 
This was originally proved for elliptic curves with non-integral j-invariant in [35], 
but the condition was removed in the most recent work [18]. 

In this chapter, we give an outline of the proof of this theorem. Since the pre- 
requisites are formidable, we aim only for conceptual clarity and arrange the main 
ideas for the appreciation and understanding of the reader. We also indicate how one 
can generalize the theorem so as to introduce additional constraints on the primes, 
such as primes in a fixed arithmetic progression. In particular, for any interval J in 
[—1, 1], we will show that for a given natural number g and a coprime to q, the 
density of primes p =a (mod q) with t(p)/2p!!/? € 1 is 


: [vireax, 
I 


Tq) 
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We begin with a historical description of the original conjecture by Sato and Tate 
in the context of elliptic curves. 
Consider the elliptic curve F defined by the equation 


yr =x +ax +b, a,beZ. 


Let A = —16(4a? + 27b7). As discussed in greater generality in Chap. 3, for each 
prime p with (p, A) = 1, we may consider the congruence 


y>=x3+ax+b (mod p) 


and count the number N, of solutions (x, y). This quantity was first studied by Emil 
Artin [10] in his 1924 doctoral thesis in which he conjectured that 


INp — PI S2/P (54) 


for all such primes. In many ways, his study was motivated by the classical Riemann 
hypothesis. To understand the nature of zeta functions in general, Artin defined the 
analogue of the Dedekind zeta function in the setting of a function field over a finite 
field. In the case of a quadratic extension of F ,(x) defined by 


y=xi+axt+b, 


the analogue of the Riemann hypothesis for the function field zeta function turns out 
to be equivalent to (54). In his thesis, Artin verified his conjecture for many small 
primes p but could not prove it. In February 1933, Hasse [74] proved the conjecture 
using techniques from algebraic geometry. One could say that understanding this 
function field analogue of the Riemann hypothesis was an important step in the 
annals of mathematics. The reader is referred to the historical document [54] for 
further discussions of this. 

Artin’s thesis was seminal in many ways. First, it opened up the study of alge- 
braic geometry over finite fields and connected it to the study of exponential sums 
that occur in classical analytic number theory. Second, it inspired Weil [201] to 
formulate in 1949 general conjectures that led Grothendieck [63] to chart out a vi- 
sionary program in algebraic geometry ultimately leading to a resolution of the Weil 
conjectures in the fundamental work of Deligne [39] in 1974. 

Around 1960, Mikio Sato and John Tate [193] (independently) asked about the 
distribution of the numbers 


Np— Pp 
JP 


in the interval [—2, 2] as p tends to infinity. For example, is it reasonable to expect 
that these numbers are uniformly distributed in the interval? In other words, is it true 
that for any interval [a, b] C [—2, 2], we have 


jam HUDSON = DY VELA, BI) _ 
x00 #{p <x} ~ 


b—a? 
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This question is the genesis of the Sato—Tate conjecture. Numerical evidence seemed 
to suggest otherwise. More precisely, Sato and Tate were led to predict that for a 
“generic” elliptic curve E, the following is true. If we write 


(Np — P)//p=2cos6p, O56) <7, 
and [a, 8] C [0, zr], then, their conjecture says 


# < :6 ’ 2 B _ 1 
{p<x:0,¢[o, Bl} _ / ey pr cde (sin28 — sin2a). 
x00 #{p <x} a we 2m 


The conjecture of Tate [193] on algebraic cycles also provided conceptual evi- 
dence for this. Here, “generic” means that the elliptic curve should be without com- 
plex multiplication (see [107] for details). In other words, E is said to be without 
complex multiplication (or without CM) if End E = Z. If the elliptic curve has com- 
plex multiplication, then the (essentially) uniform distribution law for the angles was 
worked out by Deuring [41] building on earlier work of Hecke [75, 76]. 

One can formulate a more general conjecture. Let E be an elliptic curve defined 
over a number field K. For each place v of K where E has good reduction, we 
may consider the group of points of EF mod v. Its cardinality (including the identity 
element) can be written as 


Nvu+l—-a, 


where Nv denotes the norm of v, and a, is an integer satisfying Hasse’s inequality 
|ay| < 2(Nv) 1/2. As before, one can therefore write 


ay = 2N(v)!/2 cos Oy, 


where 0,(E) := 0, satisfies 0 < 0, < 2. The Sato—Tate conjecture now is a state- 
ment about how the angles @, are distributed in the interval [0, 7] as v varies. When 
E has complex multiplication (CM), the distribution law is known and again follows 
from the classical work of Deuring on Hecke L-series (see [139] for details). In the 
non-CM case, one expects that the angles are uniformly distributed with respect to 
the measure 


lsr = ? (sin?) 40. 
IU 


On 18 March 2006, Taylor [194] (see also [35]) published a proof of this conjec- 
ture when E has at least one prime of multiplicative reduction. He was building on 
his earlier work with Clozel, Harris and Shepherd-Barron (see Carayol’s Séminaire 
Bourbaki article [30)]). 

Inspired by the Sato—Tate conjecture for elliptic curves, Serre [185] extended 
the conjecture for Fourier coefficients of normalized Hecke eigenforms. We review 
his formalism below and in the next sections give an axiomatic exposition that al- 
lows for an application of results by Barnet-Lamb, Geraghty, Harris and Taylor [18] 
as well as results pertaining to Hilbert modular forms by Barnet-Lamb, Gee and 
Geraghty [17]. These new developments all prove special cases of the following 
hypothesis. 
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Potential automorphy hypothesis Hx(S, 7,2) Let K be an algebraic number 
field, and z a cuspidal automorphic representation of GL2(Ax) not of CM type. 
Let S be a finite set of primes of K, and T a finite set of odd positive integers. Then, 
there exists a finite Galois extension Ks,7 over K unramified at S with Sym” (7) 
automorphic over Ks 7 for allm €T. 


Our first result is a hybrid Chebotarev—Sato—Tate theorem. 


Theorem 1.1 Let K be an algebraic number field. Let 1 be a cuspidal automorphic 
representation of GL2(Ax), which is not of CM type, M/K a finite solvable Galois 
extension with G = Gal(M/K), and C a conjugacy class of G. For every odd num- 
ber m, assume hypothesis Hx (S,T, 1) where S is the set of primes of K that ramify 
in M and T is either {m} or {1, m}. Then the density of prime ideals p for which the 
Artin symbol oy € C and the angle 0 € [a, B] withO<a<B <7 is 


2IC| £P 
T|G| Jo 


sin? 0d0. 


By the work of [18], the potential automorphy hypothesis is now a theorem in 
the case of classical normalized Hecke eigenforms not of CM type. In particular, we 
have the following corollary: 


Corollary 1.2 Let f be a normalized Hecke eigenform of weight k and arbitrary 
level, which is not of CM type. Write the pth Fourier coefficient of f as ap(f) = 
2p*—-Y/2 cos Oy(f). Let q be a natural number, and a an integer with (a, q) = 1. 
For 0 <a <6 <7, the density of primes p for which 6,(f) € [a, B] and p= 
a (mod q) is 


2 Png 
sine 0 dé. 
TP(G) Jo 


It is evident that by similar arguments, one can handle the joint distribution of 
angles of any finite set of Hecke eigenforms, provided that there is at most one 
eigenform in the set without CM. 

One can also study the joint distribution of angles of a finite collection of pairwise 
non-isogenous elliptic curves. This looks like a difficult question, and Harris has 
recently announced some progress in this direction. 

Though our treatment is informal, the background needed for a total understand- 
ing is quite formidable spanning representation theory, arithmetic algebraic geome- 
try, analytic and algebraic number theory. Still, we hope that the presentation given 
here will enable the non-expert to see how the proof is put together and appreciate 
the interplay of ideas. 

Recently, “friendly” exposés of the Sato—Tate conjecture have appeared in vari- 
ous places. The papers by Mazur [123, 124] are a good place to begin for the totally 
uninitiated reader. Here, our goal is more mathematical. We hope to give (with- 
out proofs) the main mathematical ingredients that enter into such equidistribution 
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questions so that the reader may have a conceptual understanding of the results. 

Because of the celebrated modularity conjecture (now proved by Wiles [204] and 
others [25]), one can deduce the original conjecture of Sato and Tate, at least in the 
case that F is defined over the rational number field. In the general number field 
case, it is still open whether the ays can be viewed as coming from automorphic 
representations as predicted by Langlands. Thus, one can view the Sato—Tate con- 
jecture for elliptic curves as a special case of a more general statement concerning 
distribution of eigenvalues of Hecke operators or in the general case, of Satake pa- 
rameters. (See [36] for more details.) One can formulate a function field analogue 
of the Sato—Tate conjecture, and this has been proved in various contexts. Let K 
be a rational function field in one variable over a finite field F, and let E denote 
an elliptic curve over K with nonconstant j-invariant. Let Y denote the projective 
line over F and consider the Néron model € —~ Y. This is a smooth group scheme 
whose general fibre is E and outside of a finite set S of points y € Y, the fibre €y at 
y is an elliptic curve (the “reduction” of E modulo the residue field corresponding 
to y). Thus, as an elliptic curve over a finite field, the zeta function of €, is of the 
form 


(l—ayT)(1—@T) 
(1—T) —q*89T) 


Here ay = q29)/2 6) where 0 < 0(y) <x. Let F,, denote the unique extension 
of F of degree n. Then, the Sato—Tate conjecture in this context says that as n —> 
oo, we have 


p 
#{y €Y(F,):a <0, <ph~ (| 2 ww?oan) 


This was proved by Yoshida in [205] and in a different way by K. Murty in [139]. 
Very general theorems of Sato—Tate type (in which the base Y is replaced by an 
arbitrary variety and € by families of £-adic sheaves) are proved in Deligne [40], 
Sect. 3.5. 


2 Weyl’s Criterion 


We will begin with a general discussion of the classical setting for uniform distribu- 
tion. A sequence of real numbers {x,} is called uniformly distributed (modulo 1) if 
for any pair of real numbers a, B with 0 <a < B < 1, we have 


#{n <N: xn € (a, B)} ~(B—a)N 


as N tends to infinity. 
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Theorem 2.1 (Weyl’s Criterion) The sequence {x,} is uniformly distributed mod 1 
if and only if for allm > 1, 


2. eomimxn =0(N) 


n<N 


as N tends to infinity. 


Proof (Sketch) First suppose that the sequence is uniformly distributed. We will 
show the condition is necessary. Let us observe that any continuous function f can 
be approximated by a linear combination of step functions so that for any given 
€ > 0, we have 


sup 
xe[0,1] 


<€, 


fx) — do eix,() 


where x, denotes the characteristic function of an interval J. Then, 
yf eu= Ya peer: cn) + O(N). 
n<N i n<N 

By hypothesis, 


Yo xi Gn) = U)N + 0(N), 
n<N 


where ,1(/) denotes the measure of an interval J. Now the sum 
Soci) 
i 


is a Riemann sum, and as our € gets smaller, the sum converges to the integral 


1 
/ f(x) dx. 
0 


1 1 
— 7 dx. 
5 Sten > f f(x)dx 


n<N 


Thus, 


In particular, we can apply this to cosmx and sinmx to deduce the required result. 

For the converse, we approximate x;(x) by trigonometric polynomials (which 
can be done by the Stone—Weierstrass theorem). In fact, one can be more precise. 
For any positive integer K, there are trigonometric polynomials m(x) and M(x) of 
degree < K such that 


m(x) < x1(x) <M) 
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with 
m(x)= > Geer, M(x) = > Der 
|m|<K |m|<K 
with 
agp = bop = WU) + OC/K). 
Therefore, 


#{n < Nix €T}= 7 x1@n) = M(DN + 0(N), 
n<N 


as required. 


Theorem 2.1 says that to establish uniform distribution of the angles 6,, we need 
to study the exponential sums 


> e2timoy ; 


N(v)<x 


Weyl’s theorem is the classical theorem regarding equidistribution for the group 
R/Z. Serre [185] extended the Weyl criterion for arbitrary compact groups and for- 
mulated an L-series formalism related to it. We review this in Sect. 4. In the next 
section, we review a classical Tauberian theorem that will be essential to our discus- 
sion. 


3 Wiener-—Ikehara Tauberian Theorem 


Theorem 3.1 Let f(s) = -7~., an/n* with ay > 0 and g(s) = ~~, bn /n* be two 
Dirichlet series with \|byj| < an for all n. Assume that f(s) and g(s) extend ana- 
lytically to R(s) > 1 except possibly at s = 1 where they have a simple pole with 


residues R and r (which may be zero) respectively. Then 


as x tends to infinity. 


The classical application of this theorem is the deduction of the prime number 
theorem. Let 


a — A(n) 
i=-7 = ae 


n=1 
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where A(n) = log p when n = p“ for some prime p and zero otherwise. Taking 
g(s) = f(s) in the above theorem allows us to deduce the prime number theorem 


y> A(n) ~ x 


n<x 


using the well-known fact that the Riemann zeta function does not vanish on 
Ns) = 1. 

In a fundamental paper written in 1970, Langlands [108] outlined an approach to 
the Sato—Tate conjecture using the theory of automorphic forms. (It is possible that 
some of these ideas may have had roots in the earlier work of Sato and Tate.) To 
simplify matters and notation, we will give only a rough outline of this approach. 

Firstly, Langlands suggested the automorphic viewpoint. Thus, the conjecture of 
Sato-Tate was applicable in a larger context of modular forms, or more generally, 
to automorphic forms on GL(2). For example, one could take the celebrated Ra- 
manujan t-function attached to the unique newform of weight 12 and level 1, and 
write 


T(p) = ape COS Oy. 


One expects the same Sato—Tate distribution for these angles @, as well. 
Here is a brief description of the strategy of Langlands [108]. For each natural 
number m, put 


where a, =e! and B, = e~'”, Langlands indicated that the theory of automorphic 


forms predicts that each L,,(s) should extend to an entire function. In fact, if each 
Lm(s) extends analytically for %i(s) > 1 and does not vanish there, then by the 
Tauberian theorem, we deduce for m > 1, 


sin(m + 1)0, 
> en) ae => o(x(x)) 


Nv<x 


as x — oo. Having this for each m, we will show in the next section that the Sato— 
Tate conjecture follows. Kumar Murty [139] showed that the non-vanishing hypoth- 
esis can be dispensed with because a very elegant argument extending the classical 
one of Hadamard and de la Vallée Poussin allows one to show non-vanishing from 
having analytic continuation to Xt(s) > 1. 

In the case F is the rational number field, it is now a theorem due to Wiles 
and others that L;(s) is essentially the L-function attached by Hecke to a classical 
cusp form of weight 2. Thus, in this particular case, the Langlands conjecture is 
established. The non-vanishing of L1(s) on R(s) = 1 is a result due to Rankin. 
For m = 2, Rankin-Selberg theory allows one to deduce that L2(s) extends to an 
entire function for i(s) > 1. The continuation of L2(s) to the entire complex plane 
was established by Shimura [188] in the case F = Q and in the general case by 
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Gelbart and Jacquet [58]. In very recent work, Kim and Shahidi [96] showed that 
L3(s) extends to an entire function, and later, Kim showed the same for La(s). 
For the cases 5 < m < 9, Kim and Shahidi have shown that L,,(s) extends to a 
meromorphic function for all s €¢ C which is regular for t(s) > 1, except in the 
case of m = 9 where Lo(s) may have a pole at s = 1. If Lo(s) were to have a pole 
at s = 1, then the Sato—Tate conjecture would be false, as we will indicate below. 
Thus, we can go in the reverse direction. The truth of the Sato—Tate conjecture now 
established implies that Lo(s) does not have a pole at s = 1. 


4 Weyl’s Theorem for Compact Groups 


Serre [185] gave the following reformulation of the Wey] criterion for uniform dis- 
tribution in the context of a compact group. Let G be a compact group, and X its 
space of conjugacy classes. Let jz denote its normalized Haar measure. A sequence 
of elements {x,} with x, € X is said to be uniformly distributed in X if for every 
continuous function f, we have 


Y fon~n f tay 


n<N 


as N tends to infinity. 


Theorem 4.1 (Weyl’s criterion for compact groups) Let G be a compact group with 
Haar measure 1. A sequence {Xn} is uniformly distributed in G if and only if 


Yo x Gin) = 0(N) 


n<N 


for every irreducible character x of G. 


The classical case in Theorem 2.1 corresponds to G = R/Z because in this case, 
the irreducible characters are given by x b> e277!" , 

Serre gave an interesting reformulation of this criterion in the context of L- 
functions. Let F be a field and for each place v of F, and let x, € G. For each 
irreducible representation p : G > GL,(C), we let 

L(s, p) =| [det(I — p(x,)Nv) 


Vv 


Theorem 4.2 (Serre) Suppose that for each irreducible non-trivial representation p 
of G, the L-function L(s, p) extends to an analytic function for t(s) => 1. Then, the 
sequence {xy} is uniformly distributed in X if and only if L(s, p) does not vanish on 
R(s) = | for all irreducible p. 
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In the context of the Sato—Tate conjecture, one considers the group SU(2, C) 
where the conjugacy classes are parameterized by 


10 
Y= (% es): 0<6 <n. 


The image of the Haar measure in the space of conjugacy classes of SU(2, C) is 
known to be 


2 9 
—sin“0dé. 
W 


The irreducible representations of SU(2, C) are the symmetric power representa- 
tions P, of the standard representation o; of SU(2, C) into GL(2, C). We find that 
L(s, Pm) as defined above by Serre coincide with L,,(s) defined in Sect. 3. 

Since tr Pm (Xo) = sin@m + 1)6/ sind, the Sato—Tate conjecture is equivalent to 
the assertion 

3 sin(m + 1)0, = Nore (x)) 


sin 0, 
N(v)<x 


for each natural number m. So far, this has been established only for m < 8 by the 
work of Kim and Shahidi [96]. 


5 Symmetric Power L-Series of Elliptic Curves 


Let K be an algebraic number field. Let E be an elliptic curve defined over K. Let 
S be the (finite) set of places where E has bad reduction. For each finite place v ¢ S 
of K, we know that the number of points on E mod v is given by 


N(v) +1 — ay, 
where dy is an integer satisfying Hasse’s inequality |ay| < 2N(v)!/?. As in Chap. 3, 


for each prime £, the action of Gal(K /K) on the ¢-adic Tate module gives rise to an 
£-adic representation 


p := pe: Gal(K/K) > GL2(Q.) 
which is integral, that is, the characteristic polynomial of p¢(F,) (where F, denotes 
the Frobenius automorphism at v ¢ S) has integer coefficients, independent of ¢. In 


fact, this characteristic polynomial is X? —a,X + N(v). Let us write a,N(v)!/?, 
ByN(v)!/? for the two roots of the quadratic polynomial 


X? =a, ¥ 4 NG. 
The (partial) mth symmetric power L-function is defined as 
m 
Ls(s, Sym” p) = I] I] (1 - ol p” INC). 
v¢S j=0 


Clearly, the product converges absolutely for 3i(s) > 1. 
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In 1970, Langlands [108] suggested the existence of an automorphic represen- 
tation 7, attached to GLy+1(Ax), where Ax denotes the adele ring of K, such 
that Ls(s, tn) = Ls(s, Sym’ (p)) where Ls(s, 2) is the automorphic L-function 
attached to 7, with the Euler factors corresponding to the places v € S removed. 
In fact, it is conjectured that one can define the local factors for v € S in such a 
way that the completed L-function, L(s, Sym’(p)) (which we shall abbreviate as 
Lm(s)), satisfies a functional equation relating s to 1 — s. (See, for example, [37] 
for details.) 

If the Langlands conjecture about the existence of 7, is true, then by the the- 
ory of automorphic representations, one immediately has analytic continuation of 
Ls(s, Sym” (¢)) to the entire complex plane. and by the result of K. Murty [139], 
the non-vanishing on the line t(s) = 1 follows, and the Sato—Tate conjecture fol- 
lows. The non-vanishing of the L-function on the line $¢(s) = 1 can also be deduced 
from a celebrated result of Jacquet and Shalika [88], who showed that for any au- 
tomorphic representation 77, we have L(s, 2) £0 for {t(s) = 1. So, what is known 
about the analytic continuation of Ls(s, Sym” (p))? For m = 1, this is the (partial) 
Hasse—Weil L-series attached to the elliptic curve E. It is possible to define the 
Euler factors at places in S as well, so that the completed L-function conjecturally 
admits an analytic continuation to the entire complex plane and satisfies a suitable 
functional equation. In the case K = Q, the Taniyama conjecture, proved by Wiles 
(in the semistable case) [204] and by Breuil, Conrad, Diamond and Taylor (in the 
general case) [25], asserts that there is a classical cusp form f of weight 2 and level 
N (the conductor of E) such that the Hecke L-series L(s, f) attached to f agrees 
with L}(s — 1/2). If wy is the automorphic representation associated to f, then, in 
the context of the Langlands program, we have L(s, 7) = L1(s). The series Lj(s) 
is essentially the L-function attached to , coming from the action of Gal(K /K) on 
the Tate module. 

More generally, L;,(s) is essentially the L-function attached to the representation 
Sym (po), which comes from the action of Gal(K / K). As alluded to above, one 
expects the existence of an automorphic representation 7, attached to GLm+1(Ax) 
satisfying L(s, 1m) = Lm(s). This expectation is far from being realized, though 
important advances have been made in this direction. 

What Taylor proves is not the automorphy of L,,(s) but rather its “potential au- 
tomorphy.” This fact, combined with other results in the analytic theory of auto- 
morphic L-functions, leads to the Sato—Tate conjecture. This result of “potential 
automorphy” builds on a massive collection of earlier work that can be traced back 
to the celebrated conjecture of Serre. 

Serre [184] formulated a general conjecture about representations 


p : Gal(Q/Q) > GLy(F) 


with F a finite field, which are odd and absolutely irreducible. More precisely, he 
predicted that such representations arise from classical modular forms. Serre [184] 
showed that his conjecture implies the Taniyama conjecture. On the other hand, 
Frey [55] had a remarkable insight which was completed and made more precise 
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by Ribet [172], who showed that the Taniyama conjecture implies Fermat’s last 
theorem. Thus, Serre’s conjecture offered a new approach to Fermat. Wiles [204] 
proved an important case of Serre’s conjecture that was enough to deduce that of 
Taniyama. This work led to further developments. Most recently, Chandrashekhar 
Khare [93] proved the level 1 case of the Serre conjecture. In very recent work, 
Khare and Wintenberger [94] have settled the general case for which they were 
awarded the Cole Prize by the American Mathematical Society in 2011. This last 
development not only gives a new proof of Fermat’s last theorem, but it also implies 
the strong Artin conjecture [9] for two-dimensional Galois representations of odd 
conductor. 

In his recent papers, Taylor [194] made substantial progress towards this conjec- 
ture. If K is a totally real field and m is odd, he showed that there is a finite, totally 
real Galois extension L/K such that (Sym” p) restricted to L is automorphic over 
L. One can choose an L that works simultaneously for any finite set of odd numbers. 
One can also choose it to be unramified at any finite set of places. Once this theorem 
is in hand, Taylor uses standard results from the theory of automorphic L-functions 
to deduce the Sato—Tate conjecture. We will give an outline of this deduction below. 

The key result is the potential automorphy property of the symmetric power of 
a cuspidal automorphic representation, now established for a large class of such 
representations. 

First, let us review some essential theorems from the theory of automorphic L- 
functions. We refer the reader to [130] for details, definitions and additional refer- 
ences to the literature. More precisely, we highlight pages 119 and 215 of [9] for the 
exact definitions of the notions of base change and automorphic induction. Here are 
the key theorems we will need. 

First is the theorem of base change and automorphic induction, due to Arthur and 
Clozel [9]. This says the following. Suppose that L/K is a cyclic extension and z, 
IT are cuspidal representations of GL, (Ax) and GL, (Az) respectively. Then, the 
base change of 2, denoted B(zr), and the automorphic induction 7 (/7) of IT exist. 

The second fact we need is a celebrated theorem of Jacquet and Shalika [88], 
which states that for any unitary cuspidal automorphic representation 2 of 
GL, (Ax), we have L(1 + it, 2) 40. 

The third fact needed is a non-vanishing theorem due to Shahidi [187]. This 
states that the Rankin—Selberg L-function L(s, 2, x m2) does not vanish on the line 
3t(s) = | whenever zr, and zr2 are unitary cuspidal automorphic representations. 

A fourth fact needed is the Artin reciprocity law which states that every abelian 
Artin L-function of any Galois extension of K is a Hecke L-function and corre- 
sponds to a cuspidal automorphic representation of GL; (Ax). 


6 An Outline of the Proof of the Sato—Tate Conjecture 


Here is a brief outline of the proofs in [17], [18] and [194] of the Sato—Tate conjec- 
ture. Their main theorem is: let K be a totally real field, and E/K an elliptic curve 
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with multiplicative reduction at some prime. For any odd number m, there is a finite, 
totally real Galois extension L/K such that Sym’ p becomes automorphic over L. 
In other words, (Sym” ¢)|z is automorphic over L. (One can also choose an L that 
will work simultaneously for any finite set of odd positive numbers.) 

From this result they deduce the Sato—Tate conjecture in three steps. Here is an 
outline. 


Step 1: For any intermediate field K C F C L with L/F solvable, Sym’ pe is 
automorphic over F’. In other words, (Sym” p)|- is automorphic over F for every 
F with L/F solvable. 

This is proved in his earlier paper [73]. 

Essentially, it applies the Arthur—Clozel theory of base change, the key idea being 
that the base change lift of Sym” p to L, which exists by Taylor’s main theorem, is 
Galois invariant and so must be the base change lift of an automorphic representation 
wr for every intermediate F with L/F solvable. This is because one can find a chain 


F=FoC KR CRhC + CFn=L 


of extensions so that F;+41/F; is cyclic for 0 <i <m — 1 and apply the Arthur— 
Clozel theorem for automorphic induction successively, in stages, to each of the 
cyclic extensions Fin /Fim—1,..., F\/ Fo. We refer the reader to [9] for precise details 
concerning automorphic induction. 


Step 2: Let G = Gal(L/K). Now apply Brauer induction to write 


1= Ly aj Ind§, Wi 
i 

with a; integers and yw; one-dimensional characters of nilpotent subgroups Hj of G. 
Then, 

L(s, (Sym” p) @ 1) oe (Sym” p) @ Indf, vi)” 
By Frobenius reciprocity, 

(Sym” p) ® Indf, Wi= Indy, ((Sym” p)|, 4, ® Vi). 
By Step 1, (Sym p)|,#; is automorphic over L¥i, By Artin reciprocity, yy; is 


a Hecke character x; of L#. Thus, (Sym” p)| i; © Wi is automorphic over L*, 
By invariance of L-series under induction, we deduce 


L(s, Sym” p) = 11 (Sym” p) | 1H; @ xi)", 


and the product on the right-hand side, being a product of automorphic L-functions 
by Step 1, represents a meromorphic function of s. In this way, one derives the 
meromorphic continuation of the odd symmetric power L-functions attached to E. 
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Step 3: If we apply the Jacquet-Shalika theorem which assures us that there are 
no poles on {i(s) = 1 for a cuspidal automorphic L-function L(s, 77), as well as 
the non-vanishing of L(s, 2) on t(s) = 1 for any automorphic representation 7, 
we obtain from the above product the analytic continuation and non-vanishing on 
R(s) = 1 of L(s, Sym” pe) for m odd. To treat m even, one uses induction and the 
identity 


Sym”! p® Sym”*! p= Sym” p® Sym! dD, 


which is essentially the trigonometric identity 


sinm6é 4: sin(m +2)0 _ (FS + *) (SF) 


sind sin 0 sin 0 sin 0 


(or the Clebsch-Gordon branching rule for SZ). Thus, 
L(s, Sym”! p)L(s, Sym™*! p) = L(s, (Sym” p) ® Sym! Pp); 


Now apply Step 1 with the two odd numbers 1, m to get the base change to L 
of both Sym” p and Sym! p automorphic. By the same Brauer induction trick of 
Step 2 applied to the right-hand side, one deduces that the right-hand side has a 
meromorphic continuation for all complex s. To get analytic continuation to R(s) = 
1, one needs to apply Shahidi’s results on the non-vanishing of Rankin—Selberg L- 
functions which appear on the right-hand side. Poles on the line t(s) = | can also 
be ruled out by the same theory. This completes the proof. 

In many respects, this is the elliptic analogue of Brauer’s theorem of the mero- 
morphy of Artin L-series. As can be seen, the non-vanishing of the L-series on 
the line i(s) = | is essential in the proof. This was ensured by an application of 
theorems of Jacquet, Shalika and Shahidi. The Jacquet—Shalika theorem and the 
Shahidi theorem rely on the theory of Eisenstein series (a la Langlands). There are 
other ways of establishing non-vanishing of the L-series concerned without using 
the theory of Eisenstein series. Indeed, if one is willing to admit Rankin—Selberg the- 
ory and existence and analyticity of these L-functions, then classical non-vanishing 
techniques (of the type used by Hadamard and de la Vallée Poussin) actually work. 
This method is outlined in a recent paper of Sarnak [176]. 


7 A Chebotarev—Sato—Tate Theorem and Generalizations 


In this section, we indicate the proof of Theorem 1.1. Before doing so, we discuss 
some natural generalizations of the Sato—Tate conjecture that one can consider. We 
indicate briefly what can actually be proved. Firstly, we can take two non-isogenous 
elliptic curves and consider the joint distribution of the angles. For instance, if Ey 
and E> are non-isogenous elliptic curves, both without CM, defined over Q, and 
0,(E1) and 0,(E2) are the angles respectively, it is reasonable to expect that the 
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distribution of the pair of angles (9,(£1), 6,(E2)) is given by the product distribu- 
tion 

ee ee 
=) sin“ 0; sin“ 07 d@; dO. 
More generally, one can consider two distinct non-CM automorphic representations 
and discuss their joint distribution. One would, of course, expect a similar distribu- 
tion in the general case. 

To prove such an assertion, we can use the formalism of Serre [185]. Using the 
formalism of [185], it is not difficult to show that this involves the study of certain L- 
series. Indeed, if both curves are non-CM and have associated Galois representations 
fp and p2 respectively, then one needs to show that the L-series 


L(s, Sym”! (p1) ® Sym”? (p2)) 


extends to (s) > 1 and does not vanish there. This looks like a difficult question to 
answer with the present state of knowledge. However, Harris has recently announced 
some progress in this direction. 

If however, one of the curves has CM and corresponds to a Hecke character y, 
then one needs to study the L-series 


L(s, w" @ Sym'"2(p2)) 


and establish analytic continuation and non-vanishing in the region {i(s) > 1. This 
can be done since only Hecke characters intervene, and these can be base-changed 
to any field by a well-known theorem of Weil [200]. It is also clear that one can take 
any number of CM elliptic curves and derive a similar theorem for the same reasons. 


Proof of Theorem 1.1 By standard Tauberian theory, as discussed in [185], we need 
to show that for any irreducible representation t of G = Gal(M/K), the L-function 


L(s,t @ Sym" x) 


is analytic and non-vanishing in the region t(s) > 1. Using Brauer induction, we 
write 


T= » a Ind§, di; 
i 
where c; are integers, and ¢; is an abelian character of H;, with H; certain nilpotent 
subgroups of G. By our hypothesis, we can choose L so that L and M are disjoint 


and for any odd m, Sym” zr is automorphic over L. In particular, Gal(LM/L) = G 
and 


L(s, Sym" x ®t) =| | L(s, Sym” x @ Indf, $i)". 


L 
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Since we are viewing G as the Galois group of LM/L, we can re-write this, by 
Frobenius reciprocity, as 


[THs (Ind§, ( (Sym” It) egy @vi))", 


where yf; is the Hecke character corresponding to ¢; via Artin reciprocity. As 
(Sym” p)|_ is automorphic by our hypothesis, (Sym z)|_u is automorphic by 
the theory of base change applied to the solvable extension LM/L. As in Step 1 
of Taylor’s theorem, we deduce that (Sym’” zr)| (Lm)Hi 1S automorphic over (LM yi 
by an application of the Arthur—Clozel theory. Since yy; is a Hecke character, we 
deduce that 


(Sym” 1) lume @ Vi 


is automorphic over (LM)"'. Consequently, by the invariance of L-series under 
induction, we deduce that 


L(s, Indf, ((Sym” zr) | uy ® Vi) 
is automorphic. Thus, it is analytic and non-vanishing for Si(s) > 1. This proves that 
TAS, (Sym” r) 2) x) 


extends to an analytic function for ‘i(s) > 1 and does not vanish there. This proves 
the required assertion for m odd. For m even, we proceed as before, by induction to 
obtain the desired result. This completes the proof of Theorem 1.1. 


8 Concluding Remarks 


We conclude this section with an alternate argument in the case that M/K is a nilpo- 
tent Galois extension which is simpler. In future variations, this alternate argument 
may be useful. 

By standard Tauberian theory, as discussed in [185], we need to show that for any 
irreducible representation t of Gal(K /K), with nilpotent image, the L-function 


L(s, t @ Sym” 1) 


is analytic and non-vanishing in the region t(s) > 1. Since any irreducible repre- 
sentation of a finite nilpotent group is induced from an abelian character x of a 
subgroup H, so 


Tt @ Sym” (sr) = Ind¥ (x @ Sym” (71)| pH). 


By Arthur—Clozel, x ® Sym” (zr)|, is automorphic. We now complete the proof 
as before. It is clear from the preceding arguments that if one had the automorphic 
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induction of Hecke characters, the proof would go through for any Galois setting 
and not just in the nilpotent or solvable setting. Future advances in the Langlands 
program should translate into general theorems of Chebotarev—Sato—Tate type. 

Just as in the classical case of the prime number theorem, one can ask about what 
error terms we may expect. It may be too early here to make statements that arise 
from the recent progress on potential automorphy. One would need more precise 
information about these fields Ks 7 before such error terms can be written down. 

There is an interesting conjecture due to Akiyama and Tanigawa [5] predicting 
that for any € > 0, 


#{(p <x: 0, €[a, Bl} _ ah 9 -1/2+¢ 
Hp =ai ee 6d0+ O(x ). (55) 


But this was already proved by Kumar Murty [140] in 1985 assuming that the Ln (s) 
satisfy the analogue of the Riemann hypothesis. In fact, this result makes explicit 
the dependence of the error term on the length of the interval [a, 6]. One can also 
go in the reverse direction. Assertion (55) is already a sufficiently strong conjecture, 
and one can use standard methods to deduce that it implies that every L,,(s) satisfies 
the analogue of the Riemann hypothesis for m > 1. 


ERRATUM 


Erratum to: The Ramanujan t-Function 


Erratum to: M.R. Murty, V.K. Murty, The Mathematical Legacy 
of Srinivasa Ramanujan, pp. 11-23, 
DOI 10.1007/978-81-322-0770-2_2, 
© Springer India 2013 


Page 17: The third displayed equation should be a congruence (mod 2). It is clearer 
if we note (1 — q”)*4 = (1 — q®")? (mod 2) and observe that 


[o,e) [o,e) 


TTa _ gy = yin” (2m + 1)q4m 4m = qin t4m (mod 2). 


n=1 m=0 


The online version of the original chapter can be found at doi: 10.1007/978-8 1-322-0770-2_2. 


MLR. Murty, V.K. Murty, The Mathematical Legacy of Srinivasa Ramanujan, El 
DOI 10.1007/978-8 1-322-0770-2_13, © Springer India 2013 
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